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EP 1 262 564 A2 

Description 

Background of the Invention 

s [0001 ] Since the genetic information is represented by the sequence of the four DNA building blocks deoxyadenosine- 
(dpA), deoxyguanosine- (dpG), deoxycytidine-(dpC) and deoxythymidine-5'-phosphate (dpT), DNA sequencing is one 
of the most fundamental technologies in molecular biology and the life sciences in general. The ease and the rate by 
which DNA sequences can be obtained greatly affects related technologies such as development and production of 
new therapeutic agents and new and useful varieties of plants and microorganisms via recombinant DNA technology. 

10 In particular, unraveling the DNA sequence helps in understanding human pathological conditions including genetic 
disorders, cancer and AIDS. In some cases, very subtle differences such as a one nucleotide deletion, addition or 
substitution can create serious, in some cases even fatal, consequences. Recently, DNA sequencing has become the 
core technology of the Human Genome Sequencing Project (e.g., J.E. Bishop and M. Waldholz, 1991, Genome: The 
Story of the Most Astonishing Scientific Adventure of Our Time - The Attempt to Map All the Genes in the Human Body , 

*5 Simon & Schuster, New York). Knowledge of the complete human genome DNA sequence will certainly help to under- 
stand, to diagnose, to prevent and to treat human diseases. To be able to tackle successfully the determination of the 
approximately 3 billion base pairs of the human genome in a reasonable time frame and in an economical way, rapid, 
reliable, sensitive and inexpensive methods need to be developed, which also offer the possibility of automation. The 
present invention provides such a technology. 

20 [0002] Recent reviews of today's methods together with future directions and trends are given by Barrell (The FASEB 
Journal 5, 40-45 (1991)), and Trainor ( Anal. Chem. 62, 418-26(1990)). 

[0003] Currently, DNA sequencing is performed by either the chemical degradation method of Maxam and Gilbert 
( Methods in Enzymology 65, 499-560 (1 980)) or the enzymatic dideoxynucleotide termination method of Sanger etai 
( Proc. Natl. Acad. Sci. USA 74 , 5463-67 (1 977)). In the chemical method, base specific modifications result in a base 

25 specific cleavage of the radioactive or fluorescently labeled DNA fragment. With the four separate base specific cleav- 
age reactions, four sets of nested fragments are produced which are separated according to length by polyacrylamide 
gel electrophoresis (PAGE). After autoradiography, the sequence can be read directly since each band (fragment) in 
the gel originates from a base specific cleavage event. Thus, the fragment lengths in the four "ladders" directly translate 
into a specific position in the DNA sequence. 

30 [0004] In the enzymatic chain termination method, the four base specific sets of DNA fragments are formed by starting 
with a primer/template system elongating the primer into the unknown DNA sequence area and thereby copying the 
template and synthesizing a complementary strand by DNA polymerases, such as Klenow fragment of E. coli DNA 
polymerase I, a DNA polymerase from Thermus aquaticus, Taq DNA polymerase, or a modified T7 DNA polymerase, 
Sequenase (Tabor etai, Proc. Natl. Acad.Sci.USA 84 , 4767-4771 (1987)), in the presence of chain-terminating, rea- 

35 gents. Here, the chain-terminating event is achieved by incorporating into the four separate reaction mixtures in addition 
to the four normal deoxynucleoside triphosphates, dATP, dGTP, dTTP and dCTP, only one of the chain-terminating 
dideoxynucleoside triphosphates, ddATP, ddGTP, dcfTTP or ddCTP, respectively, in a limiting small concentration. The 
four sets of resulting fragments produce, after electrophoresis, four base specific ladders from which the DNA sequence 
can be determined. 

40 [0005] A recent modification of the Sanger sequencing strategy involves the degradation of phosphorothioate-con- 
taining DNA fragments obtained by using alpha-thio dNTP instead of the normally used ddNTPs during the primer 
extension reaction mediated by DNA polymerase (Labeit etai, DNA 5, 173-177 (1986); Amersham, PCT-Application 
GB86/00349; Eckstein et ai t Nucleic Acids Res . 1_6, 9947 (1988)). Here, the four sets of base-specific sequencing 
ladders are obtained by limited digestion with exonuclease III or snake venom phosphodiesterase, subsequent sepa- 
ls ration on PAGE and visualization by radioisotopic labeling of either the primer or one of the dNTPs. In a further mod- 
ification, the base-specific cleavage is achieved by alkylating the sulphur atom in the modified phosphodiester bond 
followed by a heat treatment (Max-Planck-Gesellschaft, DE 3930312 A1). Both methods can be combined with the 
amplification of the DNA via the Polymerase Chain Reaction (PCR). 

[0006] On the upfront end, the DNA to be sequenced has to be fragmented into sequencable pieces of currently not 
50 more than 500 to 1 000 nucleotides. Starting from a genome, this is a multi-step process involving cloning and subcloning 
steps using different and appropriate cloning vectors such as YAC, cosmids, plasmids and M13 vectors (Sambrook et 
a/., Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press, 1 989). Finally, for Sanger sequenc- 
ing, the fragments of about 500 to 1000 base pairs are integrated into a specific restriction site of the replicative form 
I (RF I) of a derivative of the M13 bacteriophage (Vieria and Messing, Gene 19 , 259 (1982)) and then the double- 
55 stranded form is transformed to the single-stranded circular form to serve as a template for the Sanger sequencing 
process having a binding site for a universal primer obtained by chemical DNA synthesis (Sinha, Biernat, McManus 
and Koster, Nucleic Acids Res. 12 , 4539-57 (1 984); U.S. Patent No. 4725677 upstream of the restriction site into which 
the unknown DNA fragment has been inserted. Under specific conditions, unknown DNA sequences integrated into 
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supercoiled double-stranded plasmid DNA can be sequenced directly by the Sanger method (Chen and Seeburg, DNA 
4, 165-170(1985)) and Lim era/., Gene Anal. Techn, 5, 32-39 (1988), and, with the Polymerase Chain Reaction (PCR) 
( PCR Protocols: A Guide to Methods and Applications . Innis era/., editors, Academic Press, San Diego (1990)) cloning 
or subcloning steps could be omitted by directly sequencing off chromosomal DNA by first amplifying the DNA segment 
5 by PCR and then applying the Sanger sequencing method (Innis et a/., Proc, Natl, Acad, Sci. USA 85 , 9436-9440 
(1 988)). In this case, however, the DNA sequence in the interested region most be known at least to the extent to bind 
a sequencing primer. 

[0007] In order to be able to read the sequence from PAGE, detectable labels have,to be used in either the primer 
(very often at the 5'-end) or in one of the deoxynucleoside triphosphates, dNTP. Using radioisotopes such as 32 P, 33 P, 

10 or 35 S is still the most frequently used technique. After PAGE, the gels are exposed to X-ray films and silver grain 
exposure is analyzed. The use of radioisotopic labeling creates several problems. Most labels useful for autoradio- 
graphic detection of sequencing fragements have relatively short half-lives which can limit the useful time of the labels. 
The emission high energy beta radiation, particularly from 32 P, can lead to breakdown of the products via radtolysis so 
that the sample should be used very quickly after labeling. In addition, high energy radiation can also cause a deteri- 

15 oration of band sharpness by scattering. Some of these problems can be reduced by using the less energetic isotopes 
such as 33 P or 35 S (see, e.g., Ornstein era/., Biotechniques 3, 476 (1985)). Here, however, longer exposure times 
have to be tolerated. Above all, the use of radioisotopes poses significant health risks to the experimentalist and, in 
heavy sequencing projects, decontamination and handling the radioactive waste are other severe problems and bur- 
dens. 

20 [0008] In response to the above mentioned problems related to the use of radioactive labels, non-radioactive labeling 
techniques have been explored and, in recent years, integrated into partly automated DNA sequencing procedures. 
All these improvements utilize the Sanger sequencing strategy. The fluorescent label can be tagged to the primer 
(Smith et a/., Nature 321 , 674-679 (1986) and EPO Patent No. 87300998.9; Du Pont De Nemours EPO Application 
No. 0359225; Ansorge et al. J. Biochem. Biophys. Methods 13 , 325-32 (1 986)) or to the chain -terminating dideoxynu- 

25 closide triphosphates (Prober era/. Science 238 , 336-41 (1987); Applied Biosystems, PCT Application WO 91/05060). 
Based on either labeling the primer or the ddNTP, systems have been developed by Applied Biosystems (Smith era/., 
Science 235 , G89 (1 987); U.S. Patent Nos. 570973 and 68901 3), Du Pont De Nemours (Prober et af. Science 238 , 
336-341 (1987); U.S. Patents Nos. 881372 and 57566), Pharmacia-LKB (Ansorge et a/. Nucleic Acids Res , 15, 
4593-4602 (1987) and EMBL Patent Application DE P3724442 and P3805808.1) and Hitachi (JP 1-90844 and DE 

30 4011991 A1). A somewhat similar approach was developed by Brumbaugh et al. (Proc. Natl. Sci. USA 85 , 5610-14 
(1988) and U.S. Patent No. 4,729,947). An improved method for the Du Pont system using two electrophoretic lanes 
with two different specific labels per lane is described (PCT Application WO92/02635). A different approach uses flu- 
orescently labeled avidin and biotin labeled primers. Here, the sequencing ladders ending with biotin are reacted during 
electrophoresis with the labeled avidin which results in the detection of the individual sequencing bands (Brumbaugh 

35 et al, U.S. Patent No. 594676). 

[0009] More recently even more sensitive non-radioactive labeling techniques for DNA using chemiluminescence 
triggerable and amplifyable by enzymes have been developed (Beck, O'Keefe, Coull and Koster, Nucleic Acids Res. 
T7, 5115-5123 (1989) and Beck and Koster, Anal. Chem. 62, 2258-2270 (1990)). These labeling methods were com- 
bined with multiplex DNA sequencing (Church et al Science 240, 185-188 (1988) to provide for a strategy aimed at 

40 high throughput DNAsequencing (Koster et al., Nucleic Acids Res. Symposium Ser. No. 24, 318-321 (1991), University 
of Utah, PCT Application No. WO 90/15883); this strategy still suffers from the disadvantage of being very laborious 
and difficult to automate. 

[0010] In an attempt to simplify DNA sequencing, solid supports have been introduced. In most cases published so 
far, the template strand for sequencing (with or without PCR amplification) is immobilized on a solid support most 
45 frequently utilizing the strong biotin-avidin/streptavidin interaction (Orion- Yhtyma Oy, U.S. Patent No. 277643; M. Uhlen 
et al. Nucleic Acids Res. J_6, 3025-38 (1988); Cemu Bioteknik, PCT Application No. WO 89/09282 and Medical Re- 
search Council, GB, PCT Application No. WO 92/03575). The primer extension products synthesized on the immobi- 
lized template strand are purified of enzymes, other sequencing reagents and by-products by a washing step and then 
released under denaturing conditions by loosing the hydrogen bonds between the Watson-Crick base pairs and sub- 
so jected to PAGE separation. In a different approach, the primer extension products (not the template) from a DNA 
sequencing reaction are bound to a solid support via biotin/avidin (Du Pont De Nemours, PCT Application WO 
91/11533). In contrast to the above mentioned methods, here, the interaction between biotin and avidin is overcome 
by employing denaturing conditions (formamide/EDTA) to release the primer extension products of the sequencing 
reaction from the solid support for PAGE separation. As solid supports, beads, (e.g., magnetic beads (Dynabeads) 
55 and Sepharose beads), filters, capillaries, plastic dipsticks (e.g., polystyrene strips) and microliter wells are being 
proposed. 

[0011] All methods discussed so far have one central step in common: 

polyacrylamide gel electrophoresis (PAGE). In many instances, this represents a major drawback and limitation for 
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each of these methods. Preparing a homogeneous gel by polymerization, loading of the samples, the electrophoresis 
itself, detection of the sequence pattern (e.g., by autoradiography), removing the gel and cleaning the glass plates to 
prepare another gel are very laborious and time-consuming procedures. Moreover, the whole process is error-prone, 
difficult to automate, and, in order to improve reproducibility and reliability, highly trained and skilled personnel are 

5 required. In the case of radioactive labeling, autoradiography itself can consume from hours to days. In the case of 
fluorescent labeling, at least the detection of the sequencing bands is being performed automatically when using the 
laser-scanning devices integrated into commercial available DNA sequencers. One problem related to the fluorescent 
labeling is the influence of the four different base-specific fluorescent tags on the mobility of the fragments during 
electrophoresis and a possible overlap in the spectral bandwidth of the four specific dyes reducing the discriminating 

10 power between neighboring bands, hence, increasing the probability of sequence ambiguities. Artifacts are also pro- 
duced by base-specific interactions with the polyacrylamide gel matrix (Frank and Koster, Nucleic Acids Res. 6, 2069 
(1979)) and by the formation of secondary structures which result in "band compressions" and hence do not allow one 
to read the sequence. This problem has, in part, been overcome by using 7-deazadeoxyguanosine triphosphates (Barr 
et at., Biotechniques 4, 428 (1986)). However, the reasons for some artifacts and conspicuous bands are still under 

is investigation and need further improvement of the gel electrophoretic procedure. 

[0012] A recent innovation in electrophoresis is capillary zone electrophoresis (CZE) (Jorgenson etai, J. Chroma- 
tography 352 , 337 (1986); Gesteland et at, Nucleic Acids Res. 18, 1415-1419 (1990)) which, compared to slab gel 
electrophoresis (PAGE), significantly increases the resolution of the separation, reduces the time for an electrophoretic 
run and allows. the analysis of very small samples. Here, however, other problems arise due to the miniaturization of 

20 the whole system such as wall effects and the necessity of highly sensitive on-line detection methods. Compared to 
PAGE, another drawback is created by the fact that CZE is only a "one-lane" process, whereas in PAGE samples in 
multiple lanes can be electrophoresed simultaneously. 

[0013] Due to the severe limitations and problems related to having PAGE as an integral and central part in the 
standard DNA sequencing protocol, several methods have been proposed to do DNA sequencing without an electro- 
ns phoretic step. One approach calls for hybridization or fragmentation sequencing (Bains, Biotechnology 10 , 757-58 
(1992) and Mirzabekov etai, FEBS Letters 256 , 118-122 (1989)) utilizing the specific hybridization of known short 
oligonucleotides (e.g., octadeoxynucleotides which gives 65,536 different sequences) to a complementary DNA se- 
quence. Positive hybridization reveals a short stretch of the unknown sequence. Repeating this process by performing 
hybridizations with all possible octadeoxynucleotides should theoretically determine the sequence. In a completely 
30 different approach, rapid sequencing of DNA is done by unilaterally degrading one single, immobilized DNA fragment 
by an exonuclease in a moving flow stream and detecting the cleaved nucleotides by their specific fluorescent tag via 
laser excitation (Jett et at., J. Biomolecular Structure & Dynamics 7, 301-309, (1989); United States Department of 
Energy, PCT Application No. WO 89/03432). In another system proposed by Hyman (Anal. Biochem. 1 74, 423-436 
(1988)), the pyrophosphate generated when the correct nucleotide is attached to the growing chain on a primer-tem- 
35 plate system is used to determine the DNA sequence. The enzymes used and the DNA are held in place by solid 
phases (DEAE-Sepharose and Sepharose) either by ionic interactions or by covalent attachment. In a continuous flow- 
through system, the amount of pyrophosphate is determined via bioluminescence (luciferase). A synthesis approach 
to DNA sequencing is also used by Tsien etai (PCT Application No. WO 91/06678). Here, the incoming dNTP's are 
protected at the 3'-end by various blocking groups such as acetyl or phosphate groups and are removed before the 
40 next elongation step, which makes this process very slow compared to standard sequencing methods. The template 
DNA is immobilized on a polymer support. To detect incorporation, a fluorescent or radioactive label is additionally 
incorporated into the modified dNTP's. The same patent application also describes an apparatus designed to automate 
the process. 

[001 4] Mass spectrometry, in general, provides a means of "weighing" individual molecules by ionizing the molecules 
45 in vacuo and making them "fly" by volatilization. Under the influence of combinations of electric and magnetic fields, 
the ions follow trajectories depending on their individual mass (m) and charge (z). In the range of molecules with low 
molecular weight, mass spectrometry has long been part of the routine physical-organic repertoire for analysis and 
characterization of organic molecules by the determination of the mass of the parent molecular ion. In addition, by 
arranging collisions of this parent molecular ion with other particles (e.g., argon atoms), the molecular ion is fragmented 
50 forming secondary ions by the so-called collision induced dissociation (CID). The fragmentation pattern/pathway very 
often allows the derivation of detailed structural information. Many applications of mass spectrometric methods in the 
known in the art, particularly in biosciences, and can be found summarized in Methods in Enzymology , Vol. 1 93: "Mass 
Spectrometry" (J.A. McCloskey, editor), 1990, Academic Press, New York. 

[0015] Due to the apparent analytical advantages of mass spectrometry in providing high detection sensitivity, ac- 
55 curacy of mass measurements, detailed structural information by CID in conjunction with an MS/MS configuration and 
speed, as well as on-line data transfer to a computer, there has been considerable interest in the use of mass spec- 
trometry for the structural analysis of nucleic acids. Recent reviews summarizing this field include K. H. Schram, "Mass 
Spectrometry of Nucleic Acid Components, Biomedical Applications of Mass Spectrometry" 34, 203-287 (1990); and 
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P.F. Crain, "Mass Spectrometry Techniques in Nucleic Acid Research," Mass Spectrometry Reviews 9, 505-554 (1 990). 
The biggest hurdle to applying mass spectrometry to nucleic acids is the difficulty of volatilizing these very polar bi- 
opolymers. Therefore, "sequencing" has been limited to low molecular weight synthetic oligonucleotides by determining 
the mass of the parent molecular ion and through this, confirming the already known sequence, or alternatively, con- 

5 firming the known sequence through the generation of secondary ions (fragment ions) via CID in an MS/MS configu- 
ration utilizing, in particular, for the ionization and volatilization, the method of fast atomic bombardment (FAB mass 
spectrometry) or plasma desorption (PD mass spectrometry). As an example, the application of FAB to the analysis 
of protected dimeric blocks for chemical synthesis of oligodeoxynucleotides has been described (Koster et al. Biomed- 
ical Environmental Mass Spectrometry 14 , 111-116(1987)). 

10 [0016] Two more recent ionization/desorption techniques are electrospray/ionspray (ES) and matrix-assisted laser 
desorption/ionization (MALDI). ES mass spectrometry has been introduced by Fenn et al. ( J.Phys.Chem . 88 , 4451-59 
(1984); PCT Application No. WO 90/14148) and current applications are summarized in recent review articles (R.D. 
Smith etal., Anal. Chem. 62 , 882-89 (1990) and B. Ardrey, Electrospray Mass Spectrometry, Spectroscopy Europe , 
4, 10-18 (1992)). The molecular weights of the tetradecanucleotide d ( C ATGCC ATGG C ATG ) (SEQ ID NO:1) (Covey 

is et al. 'The Determination of Protein, Oligonucleotide and Peptide Molecular Weights by lonspray Mass Spectrometry," 
Rapid Communications in Mass Spectrometry , 2, 249-256 (1 988)), of the 21-mer d(AAATTGTGCACATCCTGCAGC) 
(SEQ ID NO:2) and without giving details of that of a tRNA with 76 nucleotides ( Methods in Enzymology , 193 , "Mass 
Spectrometry" (McCloskey, editor), p. 425, 1 990, Academic Press, New York) have been published. As a mass analyzer, 
a quadrupole is most frequently used. The determination of molecular weights in femtomole amounts of sample is very 

20 accurate due to the presence of multiple ion peaks which all could be used for the mass calculation. 

[0017] MALDI mass spectrometry, in contrast, can be particularly attractive when a time-of -flight (TOF) configuration 
is used as a mass analyzer The MALDI-TOF mass spectrometry has been introduced by Hillenkamp et al. ("Matrix 
Assisted UV-Laser Desorption/ionization: A New Approach to Mass Spectrometry of Large Biomolecules," Biological 
Mass Spectrometry (Buriingame and McCloskey, editors), Elsevier Science Publishers, Amsterdam, pp. 49-60, 1990.) 

25 Since, in most cases, no multiple molecular ion peaks are produced with this technique, the mass spectra, in principle, 
look simpler compared to ES mass spectrometry. Although DNA molecules up to a molecular weight of 410,000 daltons 
could be desorbed and volatilized (Williams et a/., "Volatilization of High Molecular Weight DNA by Pulsed Laser Ablation 
of Frozen Aqueous Solutions," Science, 246 , 1585-87 (1989)), this technique has so far only been used to determine 
the molecular weights of relatively small oligonucleotides of known sequence, e.g., oligothymidylic acids up to 18 

30 nucleotides (Huth-Fehre et al., "Matrix-Assisted Laser Desorption Mass Spectrometry of Oligodeoxythymidylic Acids," 
Rapid Communications in Mass Spectrometry , 6, 209-1 3 (1 992)) and a double-stranded DNA of 28 base pairs (Williams 
et al., "Time-of-Flight Mass Spectrometry of Nucleic Acids by Laser Ablation and Ionization from a Frozen Aqueous 
Matrix," Rapid Communications in Mass Spectrometry , 4, 348-351 (1 990)). In one publication (Huth-Fehre et al., 1 992 , 
supra), it was shown that a mixture of all the oligothymidylic acids from n=12 to n=18 nucleotides could be resolved. 

35 [0018] In U.S. Patent No. 5,064,754, RNA transcripts extended by DNA both of which are complementary to the 
DNA to be sequenced are prepared by incorporating NTP's, dNTP's and, as terminating nucleotides, ddNTP's which 
are substituted at the 5'-positton of the sugar moiety with one or a combination of the 
isotopes 12 C, 13 C, 14 C, 1 H, 2 H, 3 H, 16 0, 17 0 and 1s O. The polynucleotides obtained are degraded to 3'-nucleotides, 
cleaved at the N-glycosidic linkage and the isotopically labeled 5' -functionality removed by periodate oxidation and the 

40 resulting formaldehyde species determined by mass spectrometry. A specific combination of isotopes serves to dis- 
criminate base-specifically between internal nucleotides originating from the incorporation of NTP's and dNTP's and 
terminal nucleotides caused by linking ddNTP's to the end of the polynucleotide chain. A series of RNA/DNA fragments 
is produced, and in one embodiment, separated by electrophoresis, and, with the aid of the so-called matrix method 
of analysis, the sequence is deduced. 

45 [0019] In Japanese Patent No. 59-131909, an instrument is described which detects nucleic acid fragments sepa- 
rated either by electrophoresis, liquid chromatography or high speed gel filtration. Mass spectrometric detection is 
achieved by incorporating into the nucleic acids atoms which normally do not occur in DNA such as S, Br, I or Ag, Au, 
Pt, Os, Hg. The method, however, is not applied to sequencing of DNA using the Sanger method. In particular, it does 
not propose a base-specific correlation of such elements to an individual ddNTP. 

so [0020] PCTApplicationNo.W0 89/12694(Brennan etai, Proc. SPIE-lnt. Soc. Opt. Eng. 1 206, (NewTechnol. Cytom. 
Mol. Biol. ), pp. 60-77 (1990); and Brennan, U.S. Patent No. 5,003,059) employs the Sanger methodology for DNA 
sequencing by using a combination of either the four stable isotopes 32 S, 33 S, ^S, 36 S or 35 CI, 37 CI, 79 Br, 81 Br to spe- 
cifically label the chain -terminating ddNTP's. The sulfur isotopes can be located either in the base or at the alpha- 
position of the triphosphate moiety whereas the halogen isotopes are located either at the base or at the 3'-position of 

55 the sugar ring. The sequencing reaction mixtures are separated by an electrophoretic technique such as C2E, trans- 
ferred to a combustion unit in which the sulfur isotopes of the incorporated ddNTP's are transformed at about 900°C 
in an oxygen atmosphere. The SO s generated with masses of 64, 65, 66 or 68 is determined on-line by mass spec- 
trometry using, e.g., as mass analyzer, a quadrupole with a single ion-multiplier to detect the ion current. 
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[0021] A similar approach is proposed in U.S. Patent No. 5,002,868 (Jacobson etal., Proc. SPIE-lnt, Soc, Opt, Eng. 
1435, (Opt. Methods Ultrasensitive Detect. Anal. Tech. Appl. ), 26-35 (1991)) using Sanger sequencing with four 
ddNTP's specifically substituted at the alpha-position of the triphosphate moiety with one of the four stable sulfur iso- 
topes as described above and subsequent separation of the four sets of nested sequences by tube gel electrophoresis. 
5 The only difference is the use of resonance ionization spectroscopy (RIS) in conjunction with a magnetic sector mass 
analyzer as disclosed in U.S. Patent No. 4,442,354 to detect the sulfur isotopes corresponding to the specific nucleotide 
terminators, and by this, allowing the assignment of the DNA sequence. 

[0022] EPO Patent Applications No. 0360676 A1 and 0360677 A1 also describe Sanger sequencing using stable 
isotope substitutions in the ddNTP's such as D, 13 C, 15 N, 17 0, 18 O p 32 S, ^S, 34 S, ^S, 19 F, 35 CI, 37 CI, 79 Br, 81 Br 

10 and 127 l or functional groups such as CF 3 or Si(CH 3 ) 3 at the base, the sugar or the alpha position of the triphosphate 
moiety according to chemical functionality. The Sanger sequencing reaction mixtures are separated by tube gel elec- 
trophoresis. The effluent is converted into an aerosol by the eiectrospray/thermospray nebulizer method and then 
atomized and ionized by a hot plasma (7000 to 8000°K) and analyzed by a simple mass analyzer. An instrument is 
proposed which enables one to automate the analysis of the Sanger sequencing reaction mixture consisting of tube 

*5 electrophoresis, a nebulizer and a mass analyzer. 

[0023] The application of mass spectrometry to perform DNA sequencing by the hybridization/fragment method (see 
above) has been recently suggested (Bains, "DNA Sequencing by Mass Spectrometry: Outline of a Potential Future 
Application," Chimicaoggi -9, 13-16(1991)). 

20 Summary of the Invention 


[0024] The invention describes a new method to sequence DNA, The improvements over the existing DNA sequenc- 
ing technologies include high speed, high throughput, no required electrophoresis (and, thus, no gel reading artifacts 
due to the complete absence of an electrophoretic step), and no costly reagents involving various substitutions with 

25 stable isotopes. The invention utilizes the Sanger sequencing strategy and assembles the sequence information by 
analysis of the nested fragments obtained by base-specific chain termination via their different molecular masses using 
mass spectrometry, for example, MALDI or ES mass spectrometry. A further increase in throughput can be obtained 
by introducing mass modifications in the oligonucleotide primer, the chain-terminating nucleoside triphosphates and/ 
or the chain-elongating nucleoside triphosphates, as well as using integrated tag sequences which allow multiplexing 

30 by hybridization of tag specific probes with mass differentiated molecular weights. 


Brief Description of the FIGURES 


[0025] 


FIGURE 1 is a representation of a process to generate the samples to be analyzed by mass spectrometry. This 
process entails insertion of a DNA fragment of unknown sequence into a cloning vector such as derivatives of 
M 1 3, pUC or phagemids; transforming the double-stranded f orm into the single-stranded form; performing the four 
Sanger sequencing reactions; linking the base-specifically terminated nested fragment family temporarily to a solid 
support; removing by a washing step all by-products; conditioning the nested DNA or RNA fragments by, for ex- 
ample, cation-ion exchange or modification reagent and presenting the immobilized nested fragments either di- 
rectly to mass spectrometric analysis or cleaving the purified fragment family off the support and evaporating the 
cleavage reagent. 

FIGURE 2A shows the Sanger sequencing products using ddTTP as terminating deoxynucleoside triphosphate 
of a hypothetical DNA fragment of 50 nucleotides (SEQ ID NO:3) in length with approximately equally balanced 
base composition. The molecular masses of the various chain terminated fragments are given. 
FIGURE 2B shows an idealized mass spectrum of such a DNA fragment mixture. 

FIGURES 3A and 3B show, in analogy to FIGURES 2A and 2B, data for the same model sequence (SEQ ID NO: 
3) with ddATP as chain terminator. 

FIGURES 4A and 4B show data, analogous to FIGURES 2A and 2B when ddGTP is used as a chain terminator 
for the same model sequence (SEQ ID NO:3). 

FIGURES 5A and 5B illustrate the results obtained where chain termination is performed with ddCTP as a chain 
terminator, in a similar way as shown in FIGURES 2A and 2B for the same model sequence (SEQ ID NO:3). 
FIGURE 6 summarizes the results of FIGURES 2A to 5B, showing the correlation of molecular weights of the 
nested four fragment families to the DNA sequence (SEQ ID NO:3). 

FIGURES 7A and 7B illustrate the general structure of mass-modified sequencing nucleic acid primers or tag 
sequencing probes for either Sanger DNA or Sanger RNA sequencing. 

FIGURES 8A and 8B show the general structure for the mass-modified triphosphates for either Sanger DNA or 
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Sanger RNA sequencing. General formulas of the chain-elongating and the chain-terminating nucleoside triphos- 
phates are demonstrated. 

FIGURE 9 outlines various linking chemistries (X) with either polyethylene glycol or terminally monoalkylated pol- 
yethylene glycol (R) as an example. 
5 FIGURE 10 illustrates similar linking chemistries as shown in FIGURES 8A and BB and depicts various mass 

modifying moieties (R). 

FIGURE 1 1 outlines how multiplex mass spectrometric sequencing can work using the mass-modified nucleic acid 
primer (UP). 

FIGURE 12 shows the process of multiplex mass spectrometric sequencing employing mass-modified chain-elon- 
10 gating and/or terminating nucleoside triphosphates. 

FIGURE 13 shows multiplex mass spectrometric sequencing by involving the hybridization of mass-modified tag 
sequence specific probes. 

FIGURE 14 shows a MALDI-TOF spectrum of a mixture of oligothymidylic acids, d(pT) 12-18. 

FIGURE 15 shows a superposition of MALDI-TOF spectra of the 50. mer d(TAACGGTCATTACGGCCATTGACTG- 

15 TAGGACCTGCATTACATGACTAGCT) (SEQ ID NO:3) (500 fmol) and dT(pdT) 99 (500 fmol). 

FIGURES 1 6A-1 6M show the MALDI-TOF spectra of all 1 3 DNA sequences representing the nested dT-terminated 
fragments of the Sanger DNA sequencing simulation of Figure 2, 500 fmol each, as follows: 16A is a 7-mer; 16B 
is a 1 0-mer; 1 6C is a 1 1 -mer; 1 6D is a 1 9-mer; 1 6E is a 20-mer; 1 6F is a 24-mer; 1 6G is a 26-mer; 1 6H is a 33-mer; 
1 61 is a 37-mer; 1 6J is a 38-mer; 1 6K is a 42-mer; 1 6L is a 46-mer and 1 6M is a 50-mer. 

20 FIGURES 1 7A and 1 7B show the superposition of the spectra of FIGURE 1 6. The two panels show two different 

scales and the spectra analyzed at that scale. Figure 1 7A shows the superposition of the spectra of 1 6A-1 6F. The 
letter above each peak corresponds to the original spectra of the fragment in FIGURE 16. For example, peak B 
corresponds to FIGURE 16B; peak C corresponds to FIGURE 16C, etc. 

FIGURE 1 B shows the superimposed MALDI-TOF spectra from MALDl-MS analysis of mass-modified oligonucle- 
25 otides as described in Example 21 . 

FIGURE 19 illustrates various linking chemistries between the solid support (P) and the nucleic acid primer (NA) 
through a strong electrostatic interaction. 

FIGURES 20A and 20B illustrate various linking chemistries between the solid support (P) and the nucleic acid 
primer (NA) through a charge transfer complex of a charge transfer acceptor (A) and a charge transfer donor (D). 
30 FIGURE 21 illustrates various linking chemistries between the solid support (P) and the nucleic acid primer (NA) 

through a stable organic radical. 

FIGURE 22 illustrates a possible linking chemistry between the solid support (P) and the nucleic acid primer (NA) 
through Watson-Crick base pairing. 

FIGURE 23 illustrates linking the solid support (P) and the nucleic acid primer (NA) through a photolytically cleav- 
es able bond. 

Detailed Description of the Invention 

[0026] This invention describes an improved method of sequencing DNA. In particular, this invention employs mass 
40 spectrometry, such as matrix-assisted laser desorption/ionization (MALDI) or electrospray (ES) mass spectrometry 
(MS), to analyze the Sanger sequencing reaction mixtures. 

[0027] In Sanger sequencing, four families of chain-terminated fragments are obtained. The mass difference per 

nucleotide addition is 289.19 for dpC, 313.21 for dpA, 329.21 for dpG and 304.2 for dpT, respectively. 

[0028] In one embodiment, through the separate determination of the molecular weights of the four base-specifically 

4 5 terminated fragment families, the DNA sequence can be assigned via superposition (e.g., interpolation) of the molecular 
weight peaks of the four individual experiments. In another embodiment, the molecular weights of the four specifically 
terminated fragment families can be determined simultaneously by MS, either by mixing the products of all four reactions 
run in at least two separate reaction vessels (i.e., all run separately, or two together, or three together) or by running 
one reaction having all four chain-terminating nucleotides (e.g., a reaction mixture comprising dTTP, ddTTP, dATP, 

50 ddATP, dCTP, ddCTP, dGTP, ddGTP) in one reaction vessel. By simultaneously analyzing all four base-specifically 
terminated reaction products, the molecular weight values have been, in effect, interpolated. Comparison of the mass 
difference measured between fragments with the known masses of each chain-terminating nucleotide allows the as- 
signment of sequence to be carried out. In some instances, it may be desirable to mass modify, as discussed below, 
the chain-terminating nucleotides so as to expand the difference in molecular weight between each nucleotide. It will 

55 be apparent to those skilled in the art when mass-modification of the chain-terminating nucleotides is desirable and 
can depend, for instance, on the resolving ability of the particular spectrometer employed. By way of example, it may 
be desirable to produce four chain -terminating nucleotides, ddTTP, ddCTP 1 , ddATP 2 and ddGTP 3 where ddCTP 1 , 
ddATP 2 and ddGTP 3 have each been mass-modified so as to have molecular weights resolvable from one another by 
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the particular spectrometer being used. 

[0029] The terms chain-elongating nucleotides and chain-terminating nucleotides are well known in the art. For DNA, 
chain-elongating nucleotides include 2'-deoxyribonucleotides and chain-terminating nucleotides include 2', 3'-dideox- 
yribonucleotides. For RNA, chain-elongating nucleotides include ribonucelotides and chain -terminating nucleotides 
s include 3'-deoxyribonucleotides. The term nucleotide is also well known in the art. For the purposes of this invention, 
nucleotides include nucleoside mono-, di-, and triphosphates. Nucleotides also include modified nucleotides such as 
phosphorothioate nucleotides. 

[0030] Since mass spectrometry is a serial method, in contrast to currently used slab gel electrophoresis which allows 
several samples to be processed in parallel, in another embodiment of this invention, a further improvement can be 

10 achieved by multiplex mass spectrometry DNA sequencing to allow simultaneous sequencing of more than one DNA 
or RNA fragment. As described in more detail below, the range of about 300 mass units between one nucleotide addition 
can be utilized by employing either mass-modified nucleic acid sequencing primers or chain-elongating and/or termi- 
nating nucleoside triphosphates so as to shift the molecular weight of the base-specifically terminated fragments of a 
particular DNA or RNA species being sequenced in a predetermined manner. For the first time, several sequencing 

*5 reactions can be mass spectrometrically analyzed in parallel. In yet another embodiment of this invention, multiplex 
mass spectrometric DNA sequencing can be performed by mass modifying the fragment families through specific 
oligonucleotides (tag probes) which hybridize to specific tag sequences within each of the fragment families. In another 
embodiment, the tag probe can be covalently attached to the individual and specific tag sequence prior to mass spec- 
trometry. 

20 [0031] In one embodiment of the invention, the molecular weight values of at least two base-specifically terminated 
fragments are determined concurrently using mass spectrometry. The molecular weight values of preferably at least 
five and more preferably at least ten base-specifically terminated fragments are determined by mass spectrometry. 
Also included in the invention are determinations of the molecular weight values of at least 20 base-specifically termi- 
nated fragments and at least 30 base-specifically terminated fragments. Further, the nested base-specifically termi- 

25 nated fragments in a specific set can be purified of all reactants and by-products but are not separated from one another. 
The entire set of nested base-specifically terminated fragments is analyzed concurrently and the molecular weight 
values are determined. At least two base-specifically terminated fragments are analyzed concurrently by mass spec- 
trometry when the fragments are contained in the same sample. 

[0032] In general, the overall mass spectrometric DNA sequencing process will start with a library of small genomic 

30 fragments obtained after first randomly or specifically cutting the genomic DNA into large pieces which then, in several 
subcloning steps, are reduced in size and inserted into vectors like derivatives of M13 or pUC (e.g., M13mp18 or 
M13mp19) (see FIGURE 1). In a different approach, the fragments inserted in vectors, such as M13, are obtained via 
subcloning starting with a cDNA library. In yet another approach, the DNA fragments to be sequenced are generated 
by the polymerase chain reaction (e.g., Higuchi era/., "A General Method of in vitro Preparation and Mutagenesis of 

35 DNA Fragments: Study of Protein and DNA Interactions," Nucleic Acids Res. , 1j>, 7351 -67 (1 988)). As is known in the 
art, Sanger sequencing can start from one nucleic acid primer (UP) binding to the plus-strand or from another nucleic 
acid primer binding to the opposite minus-strand. Thus, either the complementary sequence of both strands of a given 
unknown DNA sequence can be obtained (providing for reduction of ambiguity in the sequence determination) or the 
length of the sequence information obtainable from one clone can be extended by generating sequence information 

40 from both ends of the unknown vector-inserted DNA fragment. 

[0033] The nucleic acid primer carries, preferentially at the 5'-end, a linking functionality, L, which can include a 
spacer of sufficient length and which can interact with a suitable functionality, L\ on a solid support to form a reversible 
linkage such as a photocleavable bond. Since each of the four Sanger sequencing families starts with a nucleic acid 
primer (L-UP; FIGURE 1 ) this fragment family can be bound to the solid support by reacting with functional groups, L', 

45 on the surface of a solid support and then intensively washed to remove all buffer salts, triphosphates, enzymes, 
reaction by-products, etc. Furthermore, for mass spectrometric analysis, it can be of importance at this stage to ex- 
change the cation at the phosphate backbone of the DNA fragments in order to eliminate peak broadening due to a 
heterogeneity in the cations bound per nucleotide unit. Since the L-U linkage is only of a temporary nature with the 
purpose to capture the nested Sanger DNA or RNA fragments to properly condition them for mass spectrometric 

50 analysis, there are different chemistries which can serve this purpose. In addition to the examples given in which the 
nested fragments are coupled covalently to the solid support, washed, and cleaved off the support for mass spectro- 
metric analysis, the temporary linkage can be such that it is cleaved under the conditions of mass spectrometry, i.e., 
a photocleavable bond such as a charge transfer complex or a stable organic radical. Furthermore, the linkage can be 
formed with L' being a quaternary ammonium group (some examples are given in FIGURE 1 9). In this case, preferably, 

55 the surface of the solid support carries negative charges which repel the negatively charged nucleic acid backbone 
and thus facilitates desorption. Desorption will take place either by the heat created by the laser pulse and/or, depending 
on L t ' by specific absorption of laser energy which is in resonance with the U chromophore (see, e.g., examples given 
in FIGU RE 1 9). The functionalities, L and L,' can also form a charge transfer complex and thereby form the temporary 
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L-L' linkage. Various examples for appropriate functionalities with either acceptor or donator properties are depicted 
without limitation in FIGURES 20A and 20B. Since in many cases the "charge-transfer band" can be determined by 
UV/vis spectrometry (see e.g. organic Charge Transfer Complexes by R. Foster, Academic Press, 1969), the laser 
energy can be tuned to the corresponding energy of the charge-transfer wavelength and, thus, a specific desorption 
s off the solid support can be initiated. Those skilled in the art will recognize that several combinations can serve this 
purpose and that the donor functionality can be either on the solid support or coupled to the nested Sanger DNA/RNA 
fragments or vice versa. 

[0034] In yet another approach, the temporary linkage L-L' can be generated by homolytically forming relatively stable 
radicals as exemplified in FIGURE 21. In example 4 of FIGURE 21, a combination of the approaches using charge- 

10 transfer complexes and stable organic radicals is shown. Here, the nested Sanger DNA/RNA fragments are captured 
via the formation of a charge transfer complex. Under the influence of the laser pulse, desorption (as discussed above) 
as well as ionization will take place at the radical position. In the other examples of FIGURE 21 under the influence of 
the laser pulse, the L-L' linkage will be cleaved and the nested Sanger DNA/RNA fragments desorbed and subsequently 
ionized at the radical position formed. Those skilled in the art will recognize that other organic radicals can be selected 

15 and that, in relation to the dissociation energies needed to homolytically cleave the bond between them, a corresponding 
laser wavelength can be selected (see e.g. Reactive Molecules by C. Wentrup, John Wiley & Sons, 1984). In yet 
another approach, the nested Sanger DNA/RNA fragments are captured via Watson-Crick base pairing to a solid 
support-bound oligonucleotide complementary to either the sequence of the nucleic acid primer or the tag oligonucle- 
otide sequence (see FIGURE 22). The duplex formed will be cleaved under the influence of the laser pulse and des- 

20 orption can be initiated. The solid support-bound base sequence can be presented through natural oligoribo- or oligo- 
deoxyribonucleotide as well as analogs (e.g. thio-modified phosphodiester or phosphotriester backbone) or employing 
oligonucleotide mimetics such as PNA analogs (see e.g. Nielsen et a!., Science , 254, 1497 (1991)) which render the 
base sequence less susceptible to enzymatic degradation and hence increases overall stability of the solid support- 
bound capture base sequence. With, appropriate bonds, L-L', a cleavage can be obtained directly with a laser tuned 

25 to the energy necessary for bond cleavage. Thus, the immobilized jnested Sanger fragments can be directly ablated 
during mass spectrometric analysis. 

[0035] To increase mass spectrometric performance, it may be necessary to modify the phosphodiester backbone 
prior to MS analysis. This can be accomplished by, for example, using alpha-thio modified nucleotides for chain elon- 
gation and termination. With alkylating agents such as akyliodides, iodoacetamide, p-iodoethanol, 2,3-epoxy-1-propa- 

30 nol (see FIGURE 1 0), the monothio phosphodiester bonds of the nested Sanger fragments are transformed into phos- 
photriester bonds. Multiplexing by mass modification in this case is obtained by mass-modifying the nucleic acid primer 
(UP) or the nucleoside triphosphates at the sugar or the base moiety. To those skilled in the art, other modifications of 
the nested Sanger fragments can be envisioned. In one embodiment of the invention, the linking chemistry allows one 
to cleave off the so-purified nested DNA enzymatically, chemically or physically. By way of example, the L-L' chemistry 

35 can be of a type of disulfide bond (chemically cleavable, for example, by mercaptoethanol or dithioerythrol), a biotin/ 
streptavidin system, a heterobifunctional derivative of a trityl ether group (Koster et a/., "A Versatile Acid-Labile Linker 
for Modification of Synthetic Biomolecules," Tetrahedron Letters 31 , 7095 (1990)) which can be cleaved under mildly 
acidic conditions, a levulinyl group cleavable under almost neutral conditions with a hydrazinium/acetate buffer, an 
arginine-arginine or lysine-lysine bond cleavable by an endopeptidase enzyme like trypsin or a pyrophosphate bond 

40 cleavable by a pyrophosphatase, a photocleavable bond which can be, for example, physically cleaved and the like 
(see, e.g., FIGURE 23). Optionally, another cation exchange can be performed prior to mass spectrometric analysis. 
In the instance that an enzyme-cleavable bond is utilized to immobilize the nested fragments, the enzyme used to 
cleave the bond can serve as an internal mass standard during MS analysis. 

[0036] The purification process and/or ion exchange process can be carried out by a number of other methods instead 
^5 of, or in conjunction with, immobilization on a solid support. For example, the base-specifically terminated products 
can be separated from the reactants by dialysis, filtration (including ultrafiltration), and chromatography. Likewise, these 
techniques can be used to exchange the cation of the phosphate backbone with a counter-ion which reduces peak 
broadening. 

[0037] The base-specifically terminated fragment families can be generated by standard Sanger sequencing using 
50 the Large Klenow fragment of E. coli DNA polymerase I, by Sequenase, Taq DNA polymerase and other DNA polymer- 
ases suitable for this purpose, thus generating nested DNA fragments for the mass spectrometric analysis. It is, how- 
ever, part of this invention that base-specifically terminated RNA transcripts of the DNA fragments to be sequenced 
can also be utilized for mass spectrometric sequence determination. In this case, various RNA polymerases such as 
the SP6 or the T7 RNA polymerase can be used on appropriate vectors containing, for example, the SP6 or the T7 
55 promoters (e.g. Axelrod etal, "Transcription from Bacteriophage T7 and SP6 RNA Polymerase Promoters in the Pres- 
ence of 3'-Deoxyribonucleoside 5'-triphosphate Chain Terminators," Biochemistry 24 , 5716-23 (1985)). In this case, 
the unknown DNA sequence fragments are inserted downstream from such promoters. Transcription can also be ini- 
tiated by.a nucleic acid primer (Pitulle etaf., "Initiator Oligonucleotides for the Combination of Chemical and Enzymatic 
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RNA Synthesis," Gene 112 , 101-105 (1 992)) which carries, as one embodiment of this invention, appropriate linking 
functionalities, L, which allow the immobilization of the nested RNA fragments, as outlined above, prior to mass spec- 
trometry analysis for purification and/or appropriate modification and/or conditioning. 

[0038] Forthis immobilization process of the DNA/RNA sequencing products for mass spectrometric analysis, various 
5 solid supports can be used, e.g., beads (silica gel, controlled pore glass, magnetic beads, Sephadex/Sepharose beads, 
cellulose beads, etc.), capillaries, glass fiber filters, glass surfaces, metal surfaces or plastic material. Examples of 
useful plastic materials include membranes in filter or microtiter plate formats, the latter allowing the automation of the 
purification process by employing microtiter plates which, as one embodiment of the invention, carry a permeable 
membrane in the bottom of the well functionalized with L\ Membranes can be based on polyethylene, polypropylene, 
to polyamide, polyvinylidenedifluoride and the like. Examples of suitable metal surfaces include steel, gold, silver, alumi- 
num, and copper. After purification, cation exchange, and/or modification of the phosphodiester backbone of the L-L' 
bound nested Sanger fragments, they can be cleaved off the solid support chemically, enzymatically or physically. Also, 
the L-L' bound fragments can be cleaved from the support when they are subjected to mass spectrometric analysis by 
using appropriately chosen L-L' linkages and corresponding laser energies/intensities as described above and in FIG- 
'S URES 19-23. 

[0039] The highly purified, four base-specifically terminated DNA or RNA fragment families are then analyzed with 
regard to their fragment lengths via determination of their respective molecular weights by MALDI or ES mass spec- 
trometry. 

[0040] For ES, the samples, dissolved in water or in a volatile buffer, are injected either continuously or discontinu- 
ed ously into an atmospheric pressure ionization interface (API) and then mass analyzed by a quadrupole. With the aid 
of a computer program, the molecular weight peaks are searched for the known molecular weight of the nucleic acid 
primer (UP) and determined which of the four chain-terminating nucleotides has been added to the UP. This represents 
the first nucleotide of the unknown sequence. Then, the second, the third, the n th extension product can be identified 
in a similar manner and, by this, the nucleotide sequence is assigned. The generation of multiple ion peaks which can 
25 be obtained using ES mass spectrometry can increase the accuracy of the mass determination. 

[0041] In MALDI mass spectrometry, various mass analyzers can be used, e.g., magnetic sector/magnetic deflection 
instruments in single or triple quadrupole mode (MS/MS), Fourier transform and time-of-f light (TOF) configurations as 
is known in the art of mass spectrometry. FIGURES 2A through 6 are given as an example of the data obtainable when 
sequencing a hypothetical DNA fragment of 50 nucleotides in length (SEQ ID NO:3) and having a molecular weight of 
30 1 5,344.02 daltons. The molecular weights calculated for the ddT (FIGURES 2A and 2B), ddA (FIGURES 3A and 3B) t 
ddG (FIGURES 4A and 4B) and ddC (FIGURES 5A and 5B) terminated products are given (corresponding to fragments 
of SEQ ID NO:3) and the idealized four MALDI-TOF mass spectra shown. All four spectra are superimposed, and from 
this, the DNA sequence can be generated. This is shown in the summarizing FIGURE 6, demonstrating how the mo- 
lecular weights are correlated with the DNA sequence. MALDI-TOF spectra have been generated for the ddT terminated 
35 products (FIGU RES 1 6A-1 6M) corresponding to those shown in FIGURE 2 and these spectra have been superimposed 
(FIGURES 1 7A and 1 7B). The correlation of calculated molecular weights of the ddT fragments and their experimen- 
tally-verified weights are shown in Table 1. Likewise, if all four chain -terminating reactions are combined and then 
analyzed by mass spectrometry, the molecular weight difference between two adjacent peaks can be used to determine 
the sequence. For the desorption/ionization process, numerous matrix/laser combinations can be used. 

40 


TABLE I 


50 


Correlation of calculated and experimentally verified molecular weights of the 13 DNA fragments of FIGURES 2 


and 16A-16M. 


Fragment (n-mer) 

calculated mass 

experimental mass 

difference 

7-mer 

2104.45 

2119.9 

+15.4 

10-mer 

3011.04 

3026.1 

+15.1 

11-mer 

3315.24 

3330.1 

+14.9 

19-mer 

5771.82 

5788.0 

+16.2 

20-mer 

6076.02 

6093.8 

+17.8 

24-mer 

7311.82 

7374.9 

+63.1 

26-mer 

' 7945.22 

7960.9 

+15.7 

33-mer 

10112.63 

10125.3 

+12.7 

37-mer 

11348.43 

11361.4 

+13.0 

38-mer 

11652.62 

11670.2 

+17.6 

42-mer 

12872.42 

12888.3 

+15.9 
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TABLE I (continued) 


Correlation of calculated and experimentally verified molecular weights of the 13 DNA fragments of FIGURES 2 


and 16A-16M. 


Fragment (n-mer) 

calculated mass 

experimental mass 

difference 

46-mer 

14108.22 

14125.0 

+16.8 

50-mer 

15344.02 

15362.6 

+18.6 


10 [0042] In order to increase throughput to a level necessary for high volume genomic and cDN A sequencing projects, 
a further embodiment of the present invention is to utilize multiplex mass spectrometry to simultaneously determine 
more than one sequence. This can be achieved by several, albeit different, methodologies, the basic principle being 
the mass modification of the nucleic acid primer (UP), the chain-elongating and/orterminating nucleoside triphosphates, 
' or by using mass-differentiated tag probes hybridizable to specific tag sequences. The term "nucleic acid primer" as 

15 used herein encompasses primers for both DNA and RNA Sanger sequencing. 

[0043] By way of example, FIGURE 7A presents a general formula of the nucleic acid primer (UP) and the tag probes 
(TP). The mass modifying moiety can be attached, for instance, to either the 5-end of the oligonucleotide (M 1 ), to the 
nucleobase (or bases) (M 2 , M 7 ),to the phosphate backbone (M 3 ), andtothe2'-position of the nucleoside (nucleosides) 
(M 4 , M 6 ) or/and to the terminal 3-position (M 5 ). Primer length can vary between 1 and 50 nucleotides in length. For 

20 the priming of DNA Sanger sequencing, the primer is preferentially in the range of about 1 5 to 30 nucleotides in length. 
For artificially priming the transcription in a RNA polymerase-mediated Sanger sequencing reaction, the length of the 
primer is preferentially in the range of about 2 to 6 nucleotides. If a tag probe (TP) is to hybridize to the integrated tag 
sequence of a family chain-terminated fragments, its preferential length is about 20 nucleotides. 
[0044] The table in FIGURE 7B depicts some examples of mass-modified primer/tag probe configurations for DNA, 

25 as well as RNA, Sanger sequencing. This list is, however, not meant to be limiting, since numerous other combinations 
of mass-modifying functions and positions within the oligonucleotide molecule are possible and are deemed part of 
the invention. The mass-modifying functionality can be, for example, a halogen, an azido, or of the type, XR, wherein 
X is a linking group and R is a mass-modifying functionality. The mass-modifying functionality can thus be used to 
introduce defined mass increments into the oligonucleotide molecule. 

30 [0045] In another embodiment, the nucleotides used for chain-elongation and/or termination are mass-modified. 
Examples of such modified nucleotides are shown in FIGURE 8A and 8B. Here the mass-modifying moiety, M, can be 
attached either to the nucleobase, M 2 (in case of the c 7 -deazanucleosides also to C-7, M 7 ), to the triphosphate group 
at the alpha phosphate, M 3 , or to the 2'-pdsition of the sugar ring of the nucleoside triphosphate, M 4 and M 6 . Further- 
more, the mass-modifying functionality can be added so as to affect chain termination, such as by attaching it to the 

35 3' -position of the sugar ring in the nucleoside triphosphate, M 5 . The list in FIGURE 8B represents examples of possible 
configurations for generating chain-terminating nucleoside triphosphates for RNA or DNA Sanger sequencing. For 
those skilled in the art, however, it is clear that many other combinations can serve the purpose of the invention equally 
well. In the same way, those skilled in the art will recognize that chain-elongating nucleoside triphosphates can also 
be mass-modified in a similar fashion with numerous variations and combinations in functionality and attachment po- 

40 sitions. 

[0046] Without limiting the scope of the invention, FIGURE 9 gives a more detailed description of particular examples 
of how the mass-modification, M, can be introduced for X in XR as well as using oligo-/polyethylene glycol derivatives 
for R. The mass-modifying increment in this case is 44, i.e. five different mass-modified species can be generated by 
just changing m from 0 to 4 thus adding mass units of 45 (m=0), 89 (m=1), 133 (m=2), 177 (m=3) and 221 (m=4) to 

45 the nucleic acid primer (UP), the tag probe (TP) or the nucleoside triphosphates respectively. The oligo/polyethylene 
glycols can also be monoalkylated by a lower alkyl such as methyl, ethyl, propyl, isopropyl, t-butyl and the like. A 
selection of linking functionalities, X, are also illustrated. Other chemistries can be used in the mass-modified com- 
pounds, as for example, those described recently in Oligonucleotides and Analogues, A Practical Approach , F. Eckstein, 
editor, IRL Press, Oxford, 1991 . 

50 [0047] In yet another embodiment, various mass-modifying functionalities, R, other than oligo/polyethylene glycols, 
can be selected and attached via appropriate linking chemistries, X. Without any limitation, some examples are given 
in FIGURE 10. A simple mass-modification can be achieved by substituting H for halogens like F, CI, Br and/or l t or 
pseudohalogens such as SCN, NCS, or by using different alkyl, aryl or aralkyl moieties such as methyl, ethyl, propyl, 
isopropyl, t-butyl, hexyl, phenyl, substituted phenyl, benzyl, or functional groups such as CH 2 F, CHF 2 , CF 3 , Si(CH 3 ) 3 , 

55 Si(CH 3 ) 2 (C 2 H 5 ), Si(CH 3 )(C 2 H 5 ) 2 , Si(C 2 H 5 ) 3 . Yet another mass-modification can be obtained by attaching homo- or 
heteropeptides throughX to the UP, TP or nucleoside triphosphates. One example useful in generating mass-modified 
species with a mass increment of 57 is the attachment of oligoglycines, e.g., mass-modifications of 74 (r=1, m=0), 
131 (r=1, m=2), 188 (r=1, m=3), 245 (r=1, m=4) are achieved. Simple oligoamides also can be used, e.g., mass- 
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modifications of 74 (r=1 , m=0), 88 (r=2, m=0), 102 (r=3, m=0), 116 (r=4, m=0) t etc. are obtainable. For those skilled in 
the art, it will be obvious that there are numerous possibilities in addition to those given in FIGURE 10 and the above 
mentioned reference ( Oligonucleotides and Analogues , F. Eckstein, 1 991), for introducing, in a predetermined manner, 
many different mass-modifying functionalities to UP, TP and nucleoside triphosphates which are acceptable for DNA 

.5 and RNA Sanger sequencing. 

[0048] As used herein, the superscript 0-i designates i + 1 mass differentiated nucleotides, primers or tags. In some 
instances, the superscript 0 (e.g., NTP 0 , UP 0 ) can designate an unmodified species of a particular reactant, and the 
superscript i (e.g., NTP', NTP 1 , NTP 2 , etc.) can designate the i-th mass-modified species of that reactant. If, for example, 
more than one species of nucleic acids (e.g., DNA clones) are to be concurrently sequenced by multiplex DNA se- 

10 quencing, then i + 1 different mass-modified nucleic acid primers (UP 0 , UP 1 -. ..UP') can be used to distinguish each set 
of base-specifically terminated fragments, wherein each species of mass-modified UP' can be distinguished by mass 
spectrometry from the rest. 

[0049] As illustrative embodiments of this invention, three different basic processes for multiplex mass spectrometric 
DNA sequencing employing the described mass-modified reagents are described below; 

15 

A) Multiplexing by the use of mass-modified nucleic acid primers (UP) for Sanger DNA or RNA sequencing (see 
for example FIGURE 11); 

B) Multiplexing by the use of mass-modified nucleoside triphosphates as chain elongators and/or chain terminators 
for Sanger DNA or RNA sequencing (see for example FIGURE 12); and 

20 C) Multiplexing by the use of tag probes which specifically hybridize to tag sequences which are integrated into 

part of the four Sanger DNA/RNA base-specifically terminated fragment families. Mass modification here can be 
achieved as described for FIGURES 7A, 7B, 9 and 10, or alternately, by designing different oligonucleotide se- 
quences having the same or different length with unmodified nucleotides which, in a predetermined way, generate 
appropriately differentiated molecular weights (see for example FIGURE 13). 

25 

[0050] The process of multiplexing by mass-modified nucleic acid primers (UP) is illustrated by way of example in 
' FIGURE 11 for mass analyzing four different DNA clones simultaneously. The first reaction mixture is obtained by 
standard Sanger DNA sequencing having unknown DNA fragment 1 (clone 1) integrated in an appropriate vector (e. 
g., M13mp1 8). employing an unmodified nucleic acid primer UP 0 , and a standard mixture of the four unmodified 

30 deoxynucleoside triphosphates, dNTP 0 , and with 1/1 0th of one of the four dideoxynucleoside triphosphates, ddNTP 0 . 
A second reaction mixture for DNA fragment 2 (clone 2) is obtained by employing a mass-modified nucleic acid primer 
UP 1 and, as before, the four unmodified nucleoside triphosphates, dNTP 0 , containing in each separate Sanger reaction 
1/1 0th of the chain-terminating unmodified dideoxynucleoside triphosphates ddNTP 0 . In the other two experiments, 
the four Sanger reactions have the following compositions: DNA fragment 3 (clone 3), UP 2 , dNTP 0 , ddNTP 0 and DNA 

35 fragment 4 (clone 4), UP 3 , dNTP 0 , ddNTP 0 . For mass spectrometric DNA sequencing, all base-specifically terminated 
reactions of the four clones are pooled and mass analyzed. The various mass peaks belonging to the four dideoxy- 
terminated (e.g.. ddT-terminated) fragment families are assigned to specifically elongated and ddT-terminated frag- 
ments by searching (such as by a computer program) for the known molecular ion peaks of UP 0 , UP 1 , UP 2 and UP 3 
extended by either one of the four dideoxynucleoside triphosphates, UP°-ddN°, UP 1 -ddN°, UP 2 -ddN° and UP 3 -ddN°. 

40 In this way, the first nucleotides of the four unknown DNA sequences of clone 1 to 4 are determined. The process is 
repeated, having memorized the molecular masses of the four specific first extension products, until the four sequences 
are assigned. Unambiguous mass/sequence assignments are possible even in the worst case scenario in which the 
four mass-modified nucleic acid primers are extended by the same dideoxynucleoside triphosphate, the extension 
products then being, for example, UP°-ddT, UP 1 -ddT, UP 2 -ddT and UP 3 -ddT, which differ by the known mass increment 

45 differentiating the four nucleic acid primers. In another embodiment of this invention, an analogous technique is em- 
ployed using different vectors containing, for example, the SP6 and/or T7 promoter sequences, and performing tran- 
scription with the nucleic acid primers UP 0 , UP 1 , UP 2 and UP 3 and either an RNA polymerase (e.g., SP6 or T7 RNA 
polymerase) with chain-elongating and terminating unmodified nucleoside triphosphates NTP 0 and 3'-dNTP° Here, 
the DNA sequence is being determined by Sanger RNA sequencing. 

so [0051] FIGURE 12 illustrates the process of multiplexing by mass-modified chain-elongating or/and terminating nu- 
cleoside triphosphates in which three different DNA fragments (3 clones) are mass analyzed simultaneously. The first 
DNA Sanger sequencing reaction (DNA fragment 1, clone 1) is the standard mixture employing unmodified nucleic 
acid primer UP 0 , dNTP 0 and in each of the four reactions one of the four ddNTP 0 . The second (DNA fragment 2, clone 
2) and the third (DNA fragment 3, clone 3) have the following contents: UP 0 , dNTP 0 , ddNTP 1 and UP 0 , dNTP 0 , ddNTP 2 , 

55 respectively. In a variation of this process, an amplification of the mass increment in mass-modifying the extended 
DNA fragments can be achieved by either using an equally mass-modified deoxynucleoside triphosphate (i.e., dNTP 1 , 
dNTP 2 ) for chain elongation alone or in conjunction with the homologous equally mass-modified dideoxynucleoside 
triphosphate. For the three clones depicted above, the contents of the reaction mixtures can be as follows: either UP 0 / 
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dNTP°/ddNTP°, UP°/dNTP 1 /ddNTP° and UP°/dNTP 2 /ddNTP° or UP°/dNTP°/ddNTP 0 , UP°/dNTP 1 /ddNTP 1 and UP 0 / 
dNTP 2 /ddNTP 2 , As described above, DNA sequencing can be performed by Sanger RNA sequencing employing un- 
modified nucleic acid primers, UP 0 , and an appropriate mixture of chain-elongating and terminating nucleoside triphos- 
phates. The mass-modification can be again either in the chain -terminating nucleoside triphosphate alone or in con- 

5 junction with mass -modified chain-elongating nucleoside triphosphates. Multiplexing is achieved by pooling the three 
base-specifically terminated sequencing reactions (e.g., the ddTTP terminated products) and simultaneously analyzing 
the pooled products by mass spectrometry. Again, the first extension products of the known nucleic acid primer se- 
quence are assigned, e.g., via a computer program. Mass/sequence assignments are possible even in the worst case 
in which the nucleic acid primer is extended/terminated by the same nucleotide, e.g., ddT, in all three clones. The 

10 following configurations thus obtained can be well differentiated by their different mass-modifications: UP 0 -ddT°, UP°- 
ddT 1 , UP°-ddT2. 

[0052] I n yet another embodiment of this invention, DNA sequencing by multiplex mass spectrometry can be achieved 
by cloning the DNA fragments to be sequenced in "pi ex-vectors" containing vector specific "tag sequences" as de- 
scribed (Koster et at., "Oligonucleotide Synthesis and Multiplex DNA Sequencing Using Chemiluminescent Detection," 

15 Nucleic Acids Res. Symposium Ser. No. 24, 318-321 (1991)); then pooling clones from different plex-vectors for DNA 
preparation and the four separate Sanger sequencing reactions using standard dNTP°/ddNTP° and nucleic acid primer 
UP 0 ; purifying the four multiplex fragment families via linking to a solid support through the linking group, L, at the 5'- 
end of UP; washing out all by-products, and cleaving the purified multiplex DNA fragments off the support or using the 
L-L 1 bound nested Sanger fragments as such for mass spectrometric analysis as described above; performing demul- 

20 tiplexing by one-by-one hybridization of specific "tag probes"; and subsequently analyzing by mass spectrometry (see, 
for example, FIGURE 1 3). As a reference point, the four base-specifically terminated multiplex DNA fragment families 
are run by the mass spectrometer and all ddT 0 -, ddA 0 -, ddC°- and ddG°-terminated molecular ion peaks are respectively 
detected and memorized. Assignment of, for example, ddT 0 -terminated DNA fragments to a specific fragment family 
is accomplished by another mass spectrometric. analysis after hybridization of the specific tag probe (TP) to the cor- 

25 responding tag sequence contained in the sequence of this specific fragment family. Only those molecular ion peaks 
which are capable of hybridizing to the specific tag probe are shifted to a higher molecular mass by the same known 
mass increment (e.g. of the tag probe). These shifted ion peaks, by virtue of all hybridizing to a specific tag probe, 
belong to the same fragment family. For a given fragment family, this is repeated for the remaining chain terminated 
fragment families with the same tag probe to assign the complete DNA sequence. This process is repeated i-1 times 

30 corresponding to i clones multiplexed (the i-th clone is identified by default). 

[0053] The differentiation of the tag probes for the different multiplexed clones can be obtained just by the DNA 
sequence and its ability to Watson-Crick base pair to the tag sequence. It is well known in the art how to calculate 
stringency conditions to provide for specific hybridization of a given tag probe with a given tag sequence (see, for 
example, Molecular Cloning: A laboratory manual 2ed, ed. by Sambrook, Fritsch and Maniatis (Cold Spring Harbor 

35 Laboratory Press: NY, 1989, Chapter 11). Furthermore, differentiation can be obtained by designing the tag sequence 
for each plex-vector to have a sufficient mass difference so as to be unique just by changing the length or base com- 
position or by mass-modifications according to FIGURES 7A, 7B, 9 and 10. In order to keep the duplex between the 
tag sequence and the tag probe intact during mass spectrometric analysis, it is another embodiment of the invention 
to provide for a covalent attachment mediated by, for example, photoreactive groups such as psoralen and ellipticine 

40 and by other methods known to those skilled in the art (see, for example, Helene et at., Nature 344 , 358 (1990) and 
Thuong et a/. "Oligonucleotides Attached to Intercalators, Photoreactive and Cleavage Agents" in F. Eckstein, Oligo- 
nucleotides and Analogues: A Practical Approach , IRL Press, Oxford 1991 , 283-306). 

[0054] The DNA sequence is unraveled again by searching for the lowest molecular weight molecular ion peak 
corresponding to the known UP°-tag sequence/tag probe molecular weight plus the first extension product, e.g., ddT 0 , 

45 then the second, the third, etc. 

[0055] In a combination of the latter approach with the previously described multiplexing processes, a further increase 
in multiplexing can be achieved by using, in addition to the tag probe/tag sequence interaction, mass-modified nucleic 
acid primers (FIGURES 7A and 7B) and/or mass-modified deoxynucleoside, dNTP 0- ', and/or dideoxynucleoside tri- 
phosphates, ddNTP 0-1 . Those skilled in the art will realize that the tag sequence/tag probe multiplexing approach is 

50 not limited to Sanger DNA sequencing generating nested DNA fragments with DNA polymerases. The DNA sequence, 
can also be determined by transcribing the unknown DNA sequence from appropriate promoter-containing vectors 
(see above) with various RNA polymerases and mixtures of NTP^'/S'-dNTP 0 *', thus'generating nested RNA fragments. 
[0056] In yet another embodiment of this invention, the mass-modifying functionality can be introduced by a two or 
multiple step process. In this case, the nucleic acid primer, the chain-elongating or terminating nucleoside triphosphates 

55 and/or the tag probes are, in a first step, modified by a precursor functionality such as azido, -N 3 , or modified with a 
functional group in which the R in XR is H (FIGURES 7A, 7B, 9) thus providing temporary functions, e.g., but not limited 
to -OH,-NH 2 , -NHR, -SH, -NCS, -OCO(CH 2 ) r COOH (r= 1 -20), -NHCO(CH 2 ) r COOH (r = 1 -20), -OS0 2 OH, -OCO(CH 2 ) r f 
(r = 1 -20), -OP(0-Alkyl)N(Alkyl)2. These less bulky functionalities result in better substrate properties for the enzymatic 
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DNA or RNA synthesis reactions of the DNA sequencing process. The appropriate mass-modifying functionality is then 
introduced after the generation of the nested base-specifically terminated DNA or RNA fragments prior to mass spec- 
trometry. Several examples of compounds which can serve as mass-modifying functionalities are depicted in FIGURES 

9 and 10 without limiting the scope of this invention. 

5 - [0057] Another aspect of this invention concerns kits for sequencing nucleic acids by mass spectrometry which in- 
clude combinations of the above-described sequencing reactants. For instance, in one embodiment, the kit comprises 
reactants for multiplex mass spectrometric sequencing of several different species of nucleic acid. The kit can include 
a solid support having a linking functionality (L 1 ) for immobilization of the base-specifically terminated products; at least 
one nucleic acid primer having a linking group (L) for reversibly and temporarily linking the primer and solid support 

10 through, for example, a photocleavable bond; a set of chain-eiongating nucleotides (e.g., dATP, dCTP, dGTP and dTTP, 
or ATP, CTP, GTP and UTP); a set of chain-terminating nucleotides (such as 2\3'-dideoxynucleotides for DNA synthesis 
or 3'-deoxynucleotides for RNA synthesis); and an appropriate polymerase for synthesizing complementary nucle- 
otides. Primers and/or terminating nucleotides can be mass-modified so that the base-specifically terminated fragments 
generated from one of the species of nucleic acids to be sequenced can be distinguished by mass spectrometry from 

15 all of the others. Alternative to the use of mass-modified synthesis reactants, a set of tag probes (as described above) 
can be included in the kit. The kit can also include appropriate buffers as welt as instructions for performing multiplex 
mass spectrometry to concurrently sequence multiple species of nucleic acids. 

[0058] In another embodiment, a nucleic acid sequencing kit can comprise a solid support as described above, a 
primer for initiating synthesis of complementary nucleic acid fragments, a set of chain-elongating nucleotides and an 
20 appropriate polymerase. The mass-modified chain-terminating nucleotides are selected so that the addition of one of 
the chain terminators to a growing complementary nucleic acid can be distinguished by mass spectrometry. 

EXAMPLE 1 

25 Immobilization of primer-extension products of Sanger DNA sequencing reaction for mass spectrometric 
analysis via disulfide bonds. 

[0059] As a solid support, Sequelon membranes (Millipore Corp., Bedford, MA) with phenyl isothiocyanate groups 
are used as a starting material. The membrane disks, with a diameter of 8 mm, are wetted with a solution of N-meth- 

30 ylmorpholine/water/2-propanol (NMM solution) (2/49/49 v/v/v), the excess liquid removed with filter paper and placed 
on a piece of plastic film or aluminum foil located on a heating block set to 55°C. A solution of 1 mM 2-mercaptoethyl- 
amine (cysteamine) or 2 t 2'-dithio-bis(ethylamine) (cystamine) or S-(2-thiopyridyl)-2-thio-ethylamine (10 ul, 10 nmpl) 
in NMM is added per disk and heated at 55°C. After 15 min, 10 ul of NMM solution are added per disk and heated for 
another 5 min. Excess of isothiocyanate groups may be removed by treatment with. 10ulofa10mM solution of glycine 

35 in NMM solution. For cystamine, the disks are treated with 10 ul of a solution of 1M aqueous dithiothreitol (DTT)/ 
2-propanol (1 :1 v/v) for 15 min at room temperature. Then, the disks are thoroughly washed in a filtration manifold with 
5 aliquots of 1 ml each of the NMM solution, then with 5 aliquots of 1 ml acetonitrile/water (l/l v/v) and subsequently 
dried. If not used immediately the disks are stored with free thiol groups in a solution of 1M aqueous dithiothreitol/ 
2-propanol (1:1 v/v) and, before use, DTT is removed by three washings with 1 ml each of the NMM solution. The 

40 primer oligonucleotides with 5'-SH functionality can be prepared by various methods (e.g., B.C.F Chu et at., Nucleic 
Acids Res. 14, 5591-5603 (1986), Sproat et a/., Nucleic Acids Res. 15, 4837-48 (1987) and Oligonucleotides and 
Analogues: A Practical Approach (F. Eckstein, editor), IRL Press Oxford, 1991). Sequencing reactions according to 
the Sanger protocol are performed in astandard way (e.g., H. Swerdlow era/., Nucleic Acids Res. 18 , 1415-19 (1990)). 
In the presence of about 7-10 mM DTT the free 5'-thiol primer can be used; in other cases, the SH functionality can 

45 be protected, e.g., by a trityl group during the Sanger sequencing reactions and removed prior to anchoring to the 
support in the following way. The four sequencing reactions (150 ul each in an Eppendorf tube) are terminated by a 

10 min incubation at 70°C to denature the DNA polymerase (such as Klenow fragment, Sequenase) and the reaction 
mixtures are ethanol precipitated. The supematants are removed and the pellets vortexed with 25 ul of an 1 M aqueous 
silver nitrate solution, and after one hour at room temperature, 50 ul of an 1M aqueous solution of DTT is added and 

50 mixed by vortexing. After 1 5 min, the mixtures are centrifuged and the pellets are washed twice with 1 00 ul ethylacetate 
by vortexing and centrifugation to remove excess DTT. The primer extension products with free S'-thiol group are now 
coupled to the thiolated membrane supports under mild oxidizing conditions. In general, it is sufficient to add the 5'- 
thiolated primer extension products dissolved in 10 ul 10 mM de-aerated triethylammonium acetate buffer (TEAA) pH 
7.2 to the thiolated membrane supports. Coupling is achieved by drying the samples onto the membrane disks with a 

55 cold fan. This process can be repeated by wetting the membrane with 1 0 ul of 1 0 mM TEAA buffer pH 7.2 and drying 
as before. When using the 2-thiopyridyl derivatized compounds, anchoring can be monitored by the release of pyridine- 
2-thione spectrophotometrically at 343 nm. 

[0060] In another variation of this approach, the oligonucleotide primer is functionalized with an amino group at the 
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5'-end which is introduced by standard procedures during automated DNA synthesis. After primer extension, during 
the Sanger sequencing process, the primary amino group is reacted with 3-(2-pyridyldithio) propionic acid N-hydrox- 
ysuccinimide ester (SPDP) and subsequently coupled to the thiolated supports and monitored by the release of pyridyl- 
2-thione as described above. After denaturation of DNA polymerase and ethanol precipitation of the sequencing prod- 

5 ucts, the supernatants are removed and the pellets dissolved in 10 uMO mM TEAA buffer pH 7.2 and 1 0 ul of a 2 mM 
solution of SPDP in 10mM TEAA are added. The reaction mixture is vortexedand incubated for 30 min at25°C. Excess 
SPDP is then removed by three extractions (vortexing, centrifugation) with 50 ul each of ethanol and the resulting 
pellets are dissolved in 10 ul 10 mM TEAA buffer pH 7.2 and coupled to the thiolated supports (see above). 
[0061] The primer-extension products are purified by washing the membrane disks three times each with 100 ul 

10 NMM solution and three times with 1 00 ul each of 1 0 mM TEAA buffer pH 7.2. The purified primer-extension products 
are released by three successive treatments with 10 ul of 10 mM 2-mercaptoethanol in 1 0 mM TEAA buffer pH 7.2, 
lyophilized and analyzed by either ES or MALDI mass spectrometry. 

[0062] This procedure can also be used for the mass-modified nucleic acid primers UP 0 "' in an analogous and ap- 
propriate way, taking into account the chemical properties of the mass-modifying functionalities. 

15 

EXAMPLE 2 

Immobilization of primer-extension products of Sanger DNA sequencing reaction for mass spectrometric 
analysis via the levulinyl group 

[0063] 5-Aminolevulinic acid is protected at the primary amino group with the Fmoc group using 9-fluorenylmethyl 
N-succinimidyl carbonate and is then transformed into the N-hydroxysuccinimide ester (NHS ester) using N-hydroxy- 
succinimide and dicyclohexyl carbodiimide under standard conditions. For the Sanger sequencing reactions, nucleic 
acid primers, UP 0 " 1 , are used which arefunctionalized with a primary amino group at the 5'-end introduced by standard 

25 procedures during automated DNA synthesis with aminolinker phosphoamidites as the final synthetic step. Sanger 
sequencing is performed under standard conditions (see above). The four reaction mixtures (1 50 ul each in an Eppen- 
dorf tube) are heated to 70°C for 10 min to inactivate the DNA polymerase, ethanol precipitated, centrifuged and 
resuspended in 1 0 ul of 1 0 mM TEAA buffer pH 7.2. 1 0 ul of a 2 mM solution of the Fmoc-5-aminolevulinyt-NHS ester 
in 10 mM TEAA buffer is added, vortexed and incubated at 25°C for 30 min. The excess of the reagent is removed by 

30 ethanol precipitation and centrifugation. The Fmoc group is cleaved off by resuspending the pellets in 10 ul of a solution 
of 20% piperidine in N,N-dimethylformamide/water (1:1 v/v). After 15 min at 25°C, piperidine is thoroughly removed 
by three precipitations/centrifugations with 100 ul each of ethanol, the pellets are resuspended in 10 ul of a solution of 
N-methylmorpholine, 2-propanol and water (2/10/88 v/v/v) and are coupled to the solid support carrying an isothiocy- 
anate group. In the case of the DITC-Sequelon membrane (MilliporeCorp., Bedford, MA), the membranes are prepared 

35 as described in EXAMPLE 1 and coupling is achieved on a heating block at 55°C as described above. RNA extension 
products are immobilized in an analogous way. The procedure can be applied to other solid supports with isothiocyanate 
groups in a similar manner. 

[0064] The immobilized primer-extension products are extensively washed three times with 100 ul each of NMM 
solution and three times with 100 ul 10 mM TEAA buffer pH 7.2. The purified primer-extension products are released 
40 by three successive treatments with 10 ul of 100 mM hydrazinium acetate buffer pH 6.5, lyophilized and analyzed by 
either ES or MALDI mass spectrometry. 

EXAMPLE 3 

45 Immobilization of primer-extension products of Sanger DNA sequencing reaction for mass spectrometric 
analysis via a trypsin sensitive linkage 

[0065] Sequelon DITC membrane disks of 8 mm diameter (Millipore Corp., Bedford, MA) are wetted with 10 ul of 
NMM solution (N-methylmorpholine/propanaol-2/water; 2/49/49 v/v/v) and a linker arm introduced by reaction with 10 

50 ul of a 10 mM solution of 1 ,6-diaminohexane in NMM. The excess diamine is removed by three washing steps with 
100 ul of NMM solution. Using standard peptide synthesis protocols, two L-lysine residues are attached by two suc- 
cessive condensations with N-Fmoc-N-tBoc-L-lysine pentafluorophenylester, the terminal Fmoc group is removed with 
piperidine in NMM and the free a-amino group coupled to 1 ,4-phenylene diisothiocyanate (DITC). Excess DITC is 
removed by three washing steps with 100 ul 2-propanol and the N-tBoc groups removed with trifluoroacetic acid ac- 

55 cording to standard peptide synthesis procedures. The nucleic acid primer-extension products are prepared from oli- 
gonucleotides which carry a primary amino group at the 5'-terminus. The four Sanger DNA sequencing reaction mix- 
tures (150 u! each in Eppendorf tubes) are heated for 10 min at 70°C to inactivate the DNA polymerase, ethanol 
precipitated, and the pellets resuspended in 10 ul of a solution of N-methylmorpholine, 2-propanol and water (2/10/88 
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v/v/v). This solution is transferred to the Lys-Lys-DITC membrane disks and coupled on a heating block set at 55°C. 
After drying, 10 ul of NMM solution is added and the drying process repeated. 

[0066] The immobilized primer-extension products are extensively washed three times with 100 ul each of NMM 
solution and three times with 100 ul each of 10 mM TEAA buffer pH 7.2. For mass spectrometry analysis, the bond 
s between the primer-extension products and the solid support is cleaved by treatment with trypsin under standard con- 
ditions and the released products analyzed by either ES or MALDI mass spectrometry with trypsin serving as an internal 
mass standard. 

EXAMPLE 4 

10 

Immobilization of primer-extension products of Sanger DNA sequencing reaction for mass spectrometry 
analysis via pyrophosphate linkage 

[0067] The DITC Sequelon membrane (disks of 8 mm diameter) are prepared as described in EXAMPLE 3 and 10 
15 ul of a 10 mM solution of 3-aminopyridine adenine dinucleotide (APAD) (Sigma) in NMM solution added. The excess 
APAD is removed by a 10 ul wash of NMM solution and the disks are treated with 1 0 ul of 1 0 mM sodium periodate in 
NMM solution (15 min, 25°C). Excess periodate is removed and the primer-extension products of the four Sanger DNA 
sequencing reactions (150 ul each in Eppendorf tubes) employing nucleic acid primers with a primary amino group at 
the 5'-end are ethanol precipitated, dissolved in 10 ul of a solution of N-methylmorpholine/2-propanol/water (2/10/88 
20 v/v/v) and coupled to the 2' 3'-dialdehydo groups of the immobilized NAD analog. 

[0068] The primer-extension products are extensively washed with the NMM solution (3 times with 1 00 ul each) and 
10 mM TEAA buffer pH 7.2 (3 times with 100 ul each) and the purified primer-extension products are released by 
treatment with either NADase or pyrophosphatase in 1 0 mM TEAA buffer at pH 7.2 at 37°C for 1 5 min, lyophilized and 
analyzed by either ES or MALDI mass spectrometry, the enzymes serving as internal mass standards. 

25 

EXAMPLE 5 

Synthesis of nucleic acid primers mass-modified by glycine residues at the S'-position of the sugar moiety of 
the terminal nucleoside 

30 

[0069] Oligonucleotides are synthesized by standard automated DNA synthesis using f}-cyanoethylphosphoamidites 
(H. Koster et al., Nucleic Acids Res. 12, 4539 (1984)) and a 5'-amino group is introduced at the end of solid phase 
DNA synthesis (e.g. Agrawal et at., Nucleic Acids Res. 14, 6227-45 (1986) or Sproat et ai, Nucleic Acids Res. 15, 
61 81 -96 (1 987)). The total amount of an oligonucleotide synthesis, starting with 0.25 umol CPG-bound nucleoside, is 

35 deprotected with concentrated aqueous ammonia, purified via OligoPAK™ Cartridges (Millipore Corp., Bedford, MA) 
and lyophilized! This material with a 5'-terminal amino group is dissolved in 100 ul absolute N,N-dimethylformamide 
(DMF) and condensed with 10 prnole N-Fmoc-glycine pentafluorophenyl ester for 60 min at 25°C. After ethanol pre- 
cipitation and centrifugation, the Fmoc group is cleaved off by a 10 min treatment with 100 ul of a solution of 20% 
piperidine in N.N-dimethylformamide. Excess piperidine, DMF and the cleavage product from the Fmoc group are 

40 removed by ethanol precipitation and the precipitate lyophilized from 10 mM TEAA buffer pH 7.2. This material is now 
either used as primer for the Sanger DNA sequencing reactions or one or more glycine residues (or other suitable- 
protected amino acid active esters) are added to create a series of mass-modified primer oligonucleotides suitable for 
Sanger DNA or RNA sequencing. Immobilization of these mass-modified nucleic acid primers UP 0 "' after primer-ex- 
tension during the sequencing process can be achieved as described, e.g., in EXAMPLES 1 to 4. 

45 

EXAMPLE 6 

Synthesis of nucleic acid primers mass-modified at C-5 of the heterocyclic base of a pyrimidine nucleoside 
with glycine residues 

50 

[0070] Starting material was 5-{3-aminopropynyl-1 )-3' S'-di-p-tolyldeoxyurtdine prepared and 3* 5'-de : 0-acylated ac- 
cording to literature procedures (Haralambidis et ai, Nucleic Acids Res. 15 , 4857-76 (1987)). 0.281 g (1.0 mmole) 
5-(3-aminopropynyl-1)-2'-deoxyuridine were reacted with 0.927 g (2.0 mmole) N-Fmoc-glycine pentafluorophenylester 
in 5 ml absolute N.N-dimethylformamide in the presence of 0.129 g (1 mmole; 174 ul) N,N-diisopropylethylamine for 
55 60 min at room temperature. Solvents were removed by rotary evaporation and the product was purified by silica gel 
chromatography (Kieselgel 60, Merck; column: 2.5x 50 cm, etution with chloroform/methanol mixtures). Yield was 0.44 
g (0.78 mmole, 78 %). In order to add another glycine residue, the Fmoc group is removed with a 20 min treatment 
with 20% solution of piperidine in DMF, evaporated in vacuo and the remaining solid material extracted three times 
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with 20 ml ethytacetate. After having removed the remaining ethylacetate, N-Fmoc-glycine pentafluorophenylester is 
coupled as described above. 5-(3-(N-Fmoc-glycyl)-amidopropynyl-1)-2'-deoxyuridine is transformed into the 5'-0- 
dimethoxytritylated nucleoside-3'-0-p-cyanoethyl-N,N-diisopropylphosphoamidite and incorporated into automated ol- 
igonucleotide synthesis by standard procedures (H. Koster et at., Nucleic Acids Res. 12 , 2261 (1984)). This glycine 
5 modified thymidine analogue building block for chemical DNA synthesis can be used to substitute one or more of the 
thymidine/uridine nucleotides in the nucleic acid primer sequence. The Fmoc group is removed at the end of the solid 
phase synthesis with a 20 min treatment with a 20 % solution of piperidine in DM F at room temperature. DM F is removed 
by a washing step with acetonitrile and the oligonucleotide deprotected and purified in the standard way. 

10 EXAMPLE 7 

Synthesis of a nucleic acid primer mass-modified at C-5 of the heterocyclic base of a pyrimidine nucleoside 
with p-alanine residues 

'5 [0071] Starting material was the same as in EXAMPLE 6. 0,281 g (1 .0 mmole) 5-(3-Aminopropynyl-1 )-2'-deoxyuri- 
dine was reacted with N-Fmoc-p-alanine pentafluorophenylester (0.955 g, 2.0 mmole) in 5 ml N,N-dimethylformamide 
(DMF) in the presence of 0.129 g (174 ul; 1.0 mmole) N,N-disopropylethylamine for 60 min at room temperature. 
Solvents were removed and the product purified by silica gel chromatography as described in EXAMPLE 6. Yield was 
0.425 g (0.74 mmole, 74 %). Another p-alanine moiety can be added in exactly the same way after removal of the 

20 Fmoc group. The preparation of the 5'-0-dimethoxytritylated nucleoside-3'-0-p-cyanoethyl-N,N-diisopropylphospho- 
amidite from 5-(3-(N-Fmoc-(S-alanyl)-amidopropynyl-1)-2'-deoxyuridine and incorporation into automated oligonucle- 
otide synthesis is performed under standard conditions. This building block can substitute for any of the thymidine/ 
uridine residues in the nucleic acid primer sequence. In the case of only one incorporated mass-modified nucleotide, 
the nucleic acid primer molecules prepared according to EXAMPLES 6 and 7 would have a mass difference of 14 

25 daltons. 

EXAMPLE 8 

Synthesis of a nucleic acid primer mass-modified at C-5 of the heterocyclic base of a pyrimidine nucleoside 
30 with ethylene glycol monomethyl ether 

[0072] As a nucleosidic component, 5-(3-aminopropynyl-1)-2'-deoxyuridine was used in this example (see EXAM- 
PLES 6 and 7). The mass-modifying functionality was obtained as follows: 7.61 g (100.0 mmole) freshly distilled eth- 
ylene glycol monomethyl ether dissolved in 50 ml absolute pyridine was reacted with 10.01 g (100.0 mmole) recrys- 

35 tallized succinic anhydride in the presence of 1.22 g (10.0 mmole) 4-N,N-dimethylaminopyridine overnight at room 
temperature. The reaction was terminated by the addition of water (5.0 ml), the reaction mixture evaporated in vacuo, 
co-evaporated twice with dry toluene (20 ml each) and the residue redissolved in 1 00 ml dichloromethane. The solution 
was extracted successively, twice with 1 0 % aqueous citric acid (2 x 20 ml) and once with water (20 ml) and the organic 
phase dried over anhydrous sodium sulfate. The organic phase was evaporated in vacuo, the residue redissolved in 

40 50 ml dichloromethane and precipitated into 500 ml pentane and the precipitate dried in vacuo. Yield was 1 3. 1 2 g (74.0 
mmole; 74 %). 8.86 g (50.0 mmole) of succinylated ethylene glycol monomethyl ether was dissolved in 1 00 ml dioxane 
containing 5% dry pyridine (5 mi) and 6.96 g (50.0 mmole) 4-nitrophenol and 10.32 g (50.0 mmole) dicyclohexylcar- 
bodiimide was added and the reaction run at room temperature for 4 hours. Dicyclohexylurea was removed by filtration, 
the filtrate evaporated in vacuo and the residue redissolved in 50 ml anhydrous DMF. 12.5 ml (about 12.5 mmole 

45 4-nitrophenylester) of this solution was used to dissolve 2.81 g (10.0 mmole) 5-(3-aminopropynyl-1)-2 1 -deoxyuridine. 
The reaction was performed in the presence of 1 .01 g (10.0 mmole; 1 .4 ml) triethylamine at room temperature overnight. 
The reaction mixture was evaporated in vacuo, co-evaporated with toluene, redissolved in dichloromethane and chro- 
matographed on silicagel (Si60, Merck; column 4x50 cm) with dichloromethane/methanol mixtures. The fractions con- 
taining the desired compound were collected, evaporated, redissolved in 25 ml dichloromethane and precipitated into 

50 250 ml pentane. The dried precipitate of 5-(3-N-(0-succinyl ethylene glycol monomethyl ether)-amidopropynyl-1)-2'- 
deoxyuridine (yield: 65 %) is S'-O-dimethoxytritylated and transformed into the nucleoside-S'-O-p-cyanoethyl-N.N-di- 
isopropylphosphoamidite and incorporated as a building block in the automated oligonucleotide synthesis according 
to standard procedures. The mass-modified nucleotide can substitute for one or more of the thymidine/uridine residues 
in the nucleic acid primer sequence. Deprotection and purification of the primer oligonucleotide also follows standard 

55 procedures. 
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EXAMPLE 9 

Synthesis of a nucleic acid primer mass-modified at C-5 of the heterocyclic base of a pyrimldine nucleoside 
with diethyiene glycol monomethyl ether 

5 

[0073] Nucleosidic starting material was as in previous examples, 5-(3-aminopropynyl-1 )-2'-deoxyuridine. The mass- 
modifying functionality was obtained similar to EXAMPLE 8. 12.02 g (100.0 mmole) freshly distilled diethyiene glycol 
monomethyl ether dissolved in 50 ml absolute pyridine was reacted with 10.01 g (100.0 mmole) recrystallized succinic 
anhydride in the presence of 1 .22 g (10.0 mmole) 4-N, N-dimethylaminopyridine (DMAP) overnight at room tempera- 

10 ture. The work-up was as described in EXAMPLE 8. Yield was 18.35 g (82.3 mmole, 82.3 %). 11 ,06 g (50.0 mmole) 
- of succinylated diethyiene glycol monomethyl ether was transformed into the 4-nitrophenylester and, subsequently, 
12.5 mmole was reacted with 2.81 g (10.0 mmole) of 5-(3-aminopropynyl-1)-2'-deoxyuridine as described in EXAMPLE 
8. Yield after silica gel column chromatography and precipitation into pentane was 3.34 g (6.9 mmole, 69 %). After 
dimethoxytritylation and transformation into the nucleoside-p-cyanoethylphosphoamidite, the mass-modified building 

is block is incorporated into automated chemical DNA synthesis according to standard procedures. Within the sequence 
of the nucleic acid primer UP 0 -', one or more of the thymidine/uridine residues can be substituted by this mass-modified 
nucleotide. In the case of only one incorporated mass-modified nucleotide, the nucleic acid primers of EXAMPLES 8 
and 9 would have a mass difference of 44.05 daftons. 

20 EXAMPLE 10 

Synthesis of a nucleic acid primer mass-modified at C-8 of the heterocyclic base of deoxyadenosine with 
glycine 

25 [0074] Starting material was N 6 -benzoyl-8-bromo-5'-0-(4,4'-dimethoxytrityl)-2'-deoxyadenosine prepared according 
to literature (Singh et a/., Nucleic Acids Res. 18, 3339-45 (1990)). 632.5 mg (1.0 mmole) of this 8-bromo-deoxyade- 
nosine derivative was suspended in 5 ml absolute ethanol and reacted with 251 .2 mg (2.0 mmole) glycine methyl ester 
(hydrochloride) in the presence of 241.4 mg (2.1 mmole; 366 ul) N, N-diisopropylethylamine and refluxed until the 
starting nucleosidic material had disappeared (4-6 hours) as checked by thin layer chromatography (TLC). The solvent 

30 was evaporated and the residue purified by silica gel chromatography (column 2.5x50 cm) using solvent mixtures of 
chloroform/methanol containing 0.1 % pyridine. The product fractions were combined, the solvent evaporated, the 
fractions dissolved in 5 ml dichloromethane and precipitated into 100 ml pentane. Yield was 487 mg (0,76 mmole, 76 
%). Transformation into the corresponding nucleoside-p-cyanoethylphosphoamidite and integration into automated 
chemical DNA synthesis is performed under standard conditions. During final deprotection with aqueous concentrated 

35 ammonia, the methyl group is removed from the glycine moiety. The mass-modified building block can substitute one 
or more deoxyadenosine/adenosine residues in the nucleic acid primer sequence. 

EXAMPLE 11 

40 Synthesis of a nucleic acid primer mass-modified at C-8 of the heterocyclic base of deoxyadenosine with 
glycylglycine 

[0075] This derivative was prepared in analogy to the glycine derivative of EXAMPLE 1 0. 632.5 mg (1.0 mmole) N 6 - 
Benzoyl-8-bromo-5 t -0-(4,4'-dimethoxytrityl)-2'-deoxyadenosine was suspended in 5 ml absolute ethanol and reacted 

45 with 324.3 mg (2.0 mmole) glycyl-glycine methyl ester in the presence of 241 .4 mg (2.1 mmole, 366 uJ) N, N-diisopro- 
pylethylamine. The mixture was refluxed and completeness of the reaction checked by TLC. Work-up and purification 
was similar to that described in EXAMPLE 10. Yield after silica gel column chromatography and precipitation into 
pentane was 464 mg (0.65 mmole, 65 %). Transformation into the nucleoside-p-cyanoethylphosphoamidite and into 
synthetic oligonucleotides is done according to standard procedures. In the case where only one of the deoxyadeno- 

50 -sine/adenosine residues in the nucleic acid primer is substituted by this mass-modified nucleotide, the mass difference 
between the nucleic acid primers of EXAMPLES 10 and 11 is 57.03 daltons. 

EXAMPLE 12 

55 Synthesis of a nucleic acid primer mass-modified at the C-2' of the sugar moiety of 2'-amino-2'-deoxythymidine 
with ethylene glycol monomethyl ether residues 

[0076] Starting material was 5'-0-(4,4-dimethoxytrityl)-2 , -amino-2'-deoxythymidine synthesized according to pub- 
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lished procedures (e.g., Verheyden era/., J. Org. Chem. 36, 250-254 (1971); Sasaki era/., J. Org. Chem . 41 , 3138-3143 
(1976); Imazawa etai, J. Org. Chem. 44, 2039-2041 (1979); Hobbs etaf., J. Org. Chem. 42, 714-719 (1976); Ikehara 
etal., Chem. Pharm. Bull. Japan 26, 240-244 (1978); see also PCT Application WO 88/00201). 5"-0-(4 f 4-Dimethox- 
ytrityl)-2'-amino-2'-deoxythymidine (559.62 mg; 1.0 mmole) was reacted with 2.0 mmoie of the 4-nitrophenyl ester of 

5 succinylated ethylene glycol monomethyl ether (see EXAMPLE 8) in 1 0 ml dry DMF in the presence of 1 .0 mmole (1 40 
triethylamine for 1 8 hours at room temperature. The reaction mixture was evaporated in vacuo, co-evaporated with 
toluene, redissolved in dichloromethane and purified by silica gel chromatography (Si60, Merck; column: 2.5x50 cm; 
eluent: chloroform/methanol mixtures containing 0.1 % triethylamine). The product containing fractions were combined, 
evaporated and precipitated into pentane. Yield was 524 mg (0.73 mmol; 73 %). Transformation into the nucleoside- 

10 p-cyanoethyl-N,N-diisopropylphosphoamidite and incorporation into the automated chemical DNA synthesis protocol 
is performed by standard procedures. The mass-modftied deoxythymidine derivative can substitute for one or more of 
the thymidine residues in the nucleic acid primer. 

[0077] In an analogous way, by employing the 4-nitrophenyl ester of succinylated diethylene glycol monomethyl ether 
(see EXAMPLE 9) and triethylene glycol monomethyl ether, the corresponding mass-modified oligonucleotides are 
'5 prepared. In the case of only one incorporated mass-modified nucleoside within the sequence, the mass difference 
between the ethylene, diethylene and triethylene glycol derivatives is 44.05, 88.1 and 132.15 daltons respectively. 

EXAMPLE 13 

20 Synthesis of a nucleic acid primer mass-modified in the internucieotidic linkage via alkyiation of 
phosphorothioate groups 

[0078} Phosphorothioate-containihg oligonucleotides were prepared according to standard procedures (see e.g. Gait 
et a/., Nucleic Acids Res. , 19 1183 (1991)). One, several or all internucleottde linkages can be modified in this way. 

25 The (-)-M13 nucleic acid primer sequence (17-mer) 5'-dGTAAAACGACGGCCAGT was synthesized in 0.25 ujnole 
scale on a DNA synthesizer and one phosphorothioate group introduced after the final synthesis cycle (G to T coupling). 
Sulfurization, deprotection and purification followed standard protocols. Yield was 31 .4 nmole (12.6 % overall yield), 
corresponding to 31 .4 nmole phosphorothioate groups. Alkyiation was performed by dissolving the residue in 31 .4 p.! 
TE buffer (0.01 M Tris pH 8.0, 0.001 M EDTA) and by adding 16 uJ of a solution of 20 mM solution of 2-iodoethanol 

30 (320 nmole; i.e., 10-fold excess with respect to phosphorothioate diesters) in N.N-dimethylformamide (DMF). The 
alkylated oligonucleotide was purified by standard reversed phase HPLC (RP-1 8 Ultraphere, Beckman; column: 4.5 x 
250 mm; 100 mM triethylammonium acetate, pH 7.0 and a gradient of 5 to 40 % acetonitrile). 

[0079] In a variation of this procedure, the nucleic acid primer containing one or more phosphorothioate phosphodi- 
ester bond is used in the Sanger sequencing reactions. The primer-extension products of the four sequencing reactions 
35 are purified as exemplified in EXAMPLES 1 - 4, cleaved off the solid support, lyophilized and dissolved in 4 u.l each of 
TE buffer pH 8.0 and alkylated by addition of 2 uJ of a 20 mM solution of 2-iodoethanol in DMF. It is then analyzed by 
ES and/or MALDl mass spectrometry. 

[0080] In an analogous way, employing instead of 2-iodoethanol, e.g., 3-iodopropanol, 4-iodobutanol mass-modified 
nucleic acid primer are obtained with a mass difference of 14.03, 28.06 and 42.03 daltons respectively compared to 
*o the unmodified phosphorothioate phosphodiester-containing oligonucleotide. 

EXAMPLE 14 

Synthesis of 2'-amino-2'-deoxyuridine-5'-triphosphate and S'-amino^'^'-dideoxythymidine-S'-triphosphate 
45 mass-modified at the 2'- or 3'-amino function with glycine or ^-alanine residues 

[0081] Starting material was 2'-azido-2'-deoxyuridine prepared according to literature (Verheyden et at, J. Org, 
Chem , 36, 250 (1971)), which was 4,4-dimethoxytritylated at 5-OH with 4,4-dimethoxytrityl chloride in pyridine and 
acetylated at 3'-OH with acetic anhydride in a one-pot reaction using standard reaction conditions. With 191 mg (0.71 

so mmole) 2'-azido-2'-deoxyuridine as starting material, 396 mg (0.65 mmol, 90.8 %) 5'-0-(4,4-dimethoxytrrtyl)-3'-0- 
acetyl-2'-azido-2'-deoxuridine was obtained after purification via silica gel chromatography. Reduction of the azido 
group was performed using published conditions (Barta et at., Tetrahedron 46 , 587-594 (1990)). Yield of 5'- 
O^^-dimethoxytrityO-S'-O-acetyl^'-amino^'-deoxyuridine after silica gel chromatography was 288 mg (0.49 mmole; 
76 %). This protected 2'-amino-2'-deoxyuridine derivative (588 mg, 1 .0 mmole) was reacted with 2 equivalents (927 

55 mg, 2.0 mmole) N-Fmoc-glycine pentafluorophenyl ester in 1 0 ml dry DMF overnight at room temperature in the pres- 
ence of 1 .0 mmole (1 74 uJ) N,N-diisopropylethylamine. Solvents were removed by evaporation in vacuo and the residue 
purified by silica gel chromatography. Yield was 711 mg (0.71 mmole, B2 %). Detritylation was achieved by a one hour 
treatment with 80% aqueous acetic acid at room temperature. The residue was evaporated to dryness, co-evaporated 
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twice with toluene, suspended in 1 ml dry acetonitrile and 5'-phosphorylated with POCI 3 according to literature 
(Yoshikawa era/., Bull. Chem. Soc. Japan 42, 3505 (1969) and Sowa era/,, Bull. Chem. Soc. Japan 48, 2084 (1975)) 
and directly transformed in a one-pot reaction to the 5'-triphosphate using 3 ml of a 0.5 M solution (1 .5 mmole) tetra 
(tri-n-butylammonium) pyrophosphate in DMF according to literature (e.g. Seela et al., Helvetica Chimica Acta 74, 

s 1 048 (1 991 )). The Fmoc and the 3'-0-acetyl groups were removed by a one-hour treatment with concentrated aqueous 
ammonia at room temperature and the reaction mixture evaporated and lyophilized. Purification also followed standard 
procedures by using anion-exchange chromatography on DEAE-Sephadex with a linear gradient of triethyiammonium 
bicarbonate (0.1 M - 1.0 M). Triphosphate containing fractions (checked by thin layer chromatography on polyethyle- 
neimine cellulose plates) were collected, evaporated and lyophilized. Yield (by UV-absorbance of the uracil moiety) 

10 was 6B% (0.48 mmole). 

[0082] A glycyl-glycine modified 2'-amino-2'-deoxyuridine-5'-triphosphate was obtained by removing the Fmoc group 
from 5'0-(4,4-dimethoxytrityl)-3'-0-acetyl-2 , -N-(N-9-fluorenylmetnyloxycarbonyl-glycyl)-2 , -ami by 
a one-hour treatment with a 20% solution of piperidine in DMF at room temperature, evaporation of solvents, two-fold 
co-evaporation with toluene and subsequent condensation with N-Fmoc-glycine pentafluorophenyl ester. Starting with 

*5 1 .0 mmole of the 2 , -N-glycyl-2'-amino-2 l -deoxyuridine derivative and following the procedure described above, 0.72 
mmole (72%) of the corresponding 2 , -(N-glycyl-glycyl)-2'-amino-2'-deoxyuridine-5 , -triphosphate was obtained. 
[0083] Starting with 5 , -0-(4,4-dimethoxytrityl)-3'-0-acetyl-2 , -amino-2 l -deoxyuridine and coupling with N-Fmoc-p- 
alanine pentafluorophenyl ester, the corresponding 2'-(N-p-alanyl)-2'-amino-2 , -deoxyuridine-5'-triphosphate can be 
synthesized. These modified nucleoside triphosphates are incorporated during the Sanger DNA sequencing process 

20 in the primer-extension products. The mass difference between the glycine, p-alanine and glycyl-glycine mass-modified 
nucleosides is, per nucleotide incorporated, 58.06, 72.09 and 115.1 daltons respectively. 

[0084] When starting with 5 , -0-(4 ) 4-dimethoxytrityl)-3'-amino-2',3 , -dideoxythymidine (obtained by published proce- 
dures, see EXAMPLE 12), the corresponding S'-CN-glycylJ-S'-amino-/ 3'-(-N-glycyl-glycy])-3'-amino-/ and 3'-(N-(J-ala- 
f nyl)-3'-amino-2\3'-dideoxythymidine-5'-triphosphates can be obtained. These mass-modified nucleoside triphos- 
25 phates serve as a terminating nucleotide unit in the Sanger DNA sequencing reactions providing a mass difference 
per terminated fragment of 58.06, 72.09 and 115.1 daltons respectively when used in the multiplexing sequencing 
mode. The mass-differentiated fragments can then be analyzed by ES and/or MALDI mass spectrometry. 

EXAMPLE 15 

30 

Synthesis of deoxyuridine-5'-triphosphate mass-modified at C-5 of the heterocyclic base with glycine, glycyl- 
glycine and p-alanine residues. 

[0085] 0,281 g (1 .0 mmole) 5-(3-Aminopropynyl-1 )-2'-deoxyuridine (see EXAMPLE 6) was reacted with either 0.927 

35 g (2.0 mmole) N-Fmoc-glycine pentafluorophenylester or 0.955g (2.0 mmole) N-Fmoc-p-alanine pentafluorophenyl 
ester in 5 ml dry DMF in the presence of 0.129 g N t N-diisopropylethylamine (174 ul, 1.0 mmole) overnight at room 
temperature. Solvents were removed by evaporation in vacuo and the condensation products purified by flash chro- 
matography on silica gel (Still et ai, J. Org. Chem. 43 , 2923-2925 (1 978)). Yields were 476 mg (0.85 mmole: 85%) for 
the glycine and 436 mg (0.76 mmole; 76%) for the p-alanine derivatives. For the synthesis of the glycyl-glycine deriv- 

40 ative, the Fmoc group of 1 .0 mmole Fmoc-glycine-deoxyuridine derivative was removed by one-hour treatment with 
20% piperidine in DMF at room temperature. Solvents were removed by evaporation in vacuo, the residue was co- 
evaporated twice with toluene and condensed with 0.927 g (2.0 mmole) N-Fmoc-glycine pentafluorophenyl ester and 
purified as described above. Yield was 445 mg (0.72 mmole; 72%). The glycyl-, glycyl-glycyl- and p-alanyl-2'-deoxy- 
uridine derivatives, N-protected with the Fmoc group were transformed to the 3'-0-acetyl derivatives by tritylation with 

45 4,4-dimethoxytrityl chloride in pyridine and acetylation with acetic anhydride in pyridine in a one-pot reaction and sub- 
sequently detritylated by one hour treatment with 80% aqueous acetic acid according to standard procedures. Solvents 
were removed, the residues dissolved in 100 ml chloroform and extracted twice with 50 ml 10% sodium bicarbonate 
and once with 50 ml water, dried with sodium sulfate, the solvent evaporated and the residues purified by flash chro- 
matography on silica gel. Yields were 361 mg (0.60 mmole; 71%) for the glycyl-, 351 mg (0.57 mmole; 75%) for the p- 

50 alanyl- and 323 mg (0.49 mmole; 68%) for the glycyl-glycyl-3-0'-acetyl-2'-deoxyuridine derivatives respectively. Phos- 
phorylation at the 5'-OH with POCI 3 , transformation into the 5'-triphosphate by in-situ reaction with tetra(tri-n-butylam- 
monium) pyrophosphate in DMF, 3'-de-0-acetylation, cleavage of the Fmoc group, and final purification by anion- 
exchange chromatography on DEAE-Sephadex was performed as described in EXAMPLE 14. Yields according to UV- 
absorbance of the uracil moiety were 0.41 mmole 5-(3-(N-glycyl)-amidopropynyl-1)-2'-deoxyuridine-5Mriphosphate 

55 (84%), 0.43 mmole 5-(3-(N-p-alanyl)-amidopropynyl-1)-2 , -deoxyuridine-5'-triphosphate (75%) and 0.38 mmole 
5-(3-(N-glycyl-glycyl)-amidopropynyl-1 )-2'-deoxyuridine-5'-triphosphate (78%). 

[0086] These mass-modified nucleoside triphosphates were incorporated during the Sanger DNA sequencing primer- 
extension reactions. 
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[0087] When using 5-(3-aminopropynyl-1)-2\3'-dideoxy uridine as starting material and following an analogous re- 
action sequence the corresponding glycyl-, glycyl-glycyl-and fJ-alanyl^'.S'-dideoxyuridine-S'-triphosphates were ob- 
tained in yields of 69, 63 and 71% respectively. These mass-modified nucleoside triphosphates serve as chain-termi- 
nating nucleotides during the Sanger DNA sequencing reactions. The mass-modified sequencing ladders are analyzed 
s by either ES or MALDl mass spectrometry. 

EXAMPLE 16 

Synthesis of 8-glycyl- and 8-glycyl-glycyl-2"-deoxyadenosine-5'-triphosphate 

10 

[0088] 727 mg (1 .0 mmole) of N 6 -(4-tert-butylphenoxyacetyl)-8-glycyl-5'-(4,4-dimethoxytrityl)-2'- deoxyadenosine or 
800 mg (1.0 mmole) N 6 -(4-tert-butylphenoxyacetyl)-8-glycyt-glycyl-5'-(4 t 4-dimethoxytrityl)-2'-deoxyadenosine pre- 
pared according to EXAMPLES 10 and 11 and literature (Koster et af. t Tetrahedron 37, 362 (1981)) were acetylated 
with acetic anhydride in pyridine at the 3'-OH, detritylated at the S'-position with 80% acetic acid in a one-pot reaction 

is and transformed into the S'-triphosphates via phosphorylation with POCI 3 and reaction in-situ with tetra(tri : n-butylam- 
monium) pyrophosphate as described in EXAMPLE 14. Deprotection of the N 6 -tert-butylphenoxyacetyl, the 3'-0-acetyl 
and the O-methyl group at the glycine residues was achieved with concentrated aqueous ammonia for ninety minutes 
at room temperature. Ammonia was removed by lyophilization and the residue washed with dichloromethane, solvent 
removed by evaporation in vacuo and the remaining solid material purified by anion-exchange chromatography on 

20 DEAE-Sephadex using a linear gradient of triethylammonium bicarbonate from 0.1 to 1 .0 M. The nucleoside triphos- 
phate containing fractions (checked by TLC on polyethyleneimine cellulose plates) were combined and lyophillized. 
Yield of the 8-glycyl-2'-deoxyadenosine-5'-triphosphate (determined by UV-absorbance of the adenine moiety) was 
57% (0.57 mmole). The yield for the 8-glycyl-glycyl-2'-deoxyadenosine-5'-triphosphate was 51% (0.51 mmole). 
[0089] These mass-modified nucleoside triphosphates were incorporated during primer-extension in the Sanger DNA 

25 sequencing reactions. 

[0090] When using the corresponding N6-(4-tert-butylphenoxyacetyl)-8-glycyl- or -glycyl-glycyl-5'-0-(4,4-dimethox- 
ytrityl^'.S'-dideoxyadenosine derivatives as starting materials prepared according to standard procedures (see, e.g., 
for the introduction of the 2|,3 , -f unction: Seela et a/., Helvetica Chimica Acta 74, 1048-1058 (1991)) and using an 
analogous reaction sequence as described above, the chain-terminating mass-modified nucleoside triphosphates 
30 8-glycyl- and 8-glycyl-glycyl-2\3'-dideoxyadenosine-5'-triphosphates were obtained in 53 and 47% yields respectively. 
The mass-modified sequencing fragment ladders are analyzed by either ES or MALDl mass spectrometry. 

EXAMPLE 17 

35 Mass-modification of Sanger DNA sequencing fragment ladders by incorporation of chain-elongating 2'-deoxy- 
and chain-terminating 2 , ,3 , -dideoxythymidine-5'-(alpha-S-)-triphosphate and subsequent alkylation with 
2-iodoethanol and 3-iodopropanot 

[0091] 2',3 , -Dideoxythymidine-5 , -(alpha-S)-triphosphate was prepared according to published procedures (e.g., for 
40 the alpha-S-triphosphate moiety: Eckstein et ai, Biochemistry 15 , 1685 (1976) and Accounts Chem. Res. 12 , 204 
(1978) and for the 2',3'-dideoxy moiety: Seela era/., Helvetica Chimica Acta , 74, 1048-1058 (1991)). Sanger DNA 
sequencing reactions employing 2'-deoxythymidine-5'-(a1pha-S)-triphosphate are performed according to standard 
protocols (e.g. Eckstein, Ann. Rev. Biochem. 54 , 367 (1 985)). When using 2\3'-dideoxythymidine-5'-(a[pha-S)-triphos- 
phates, this is used instead of the unmodified 2\3 l -dideoxythymidine-5'-triphosphate in standard Sanger DNA sequenc- 
es mg (see e.g. Swerdlow et at, Nucleic Acids Res. 18 , 1415-1419 (1990)). The template (2 pmole) and the nucleic acid 
M1 3 sequencing primer (4 pmole) modified according to EXAMPLE 1 are annealed by heating to 65°C in 100 ul of 10 
mM Tris-HCI pH 7.5,10 mM MgCI 2 , 50 mM NaCI, 7 mM dithiothreitol (DTT) for 5 min and slowly brought to 37°C during 
a one hour period. The sequencing reaction mixtures contain, as exemplified for the TTSpecific termination reaction, in 
a final volume of 150 ul, 200 uM (final concentration) each of dATP, dCTP, dTTP, 300 uM c7-deaza-dGTP, 5 uM 2',3'- 
50 dideoxythymidine-5'-(alpha-S)-triphosphate and 40 units Sequenase (United States Biochemicals). Polymerization is 
performed for 10 min at 37°C, the reaction mixture heated to 70°C to inactivate the Sequenase, ethanol precipitated 
and coupled to thiolated Sequelon membrane disks (8 mm diameter) as described in EXAMPLE 1 . Alkylation is per- 
formed by treating the disks with 10 ul of 10 mM solution of either 2-iodoethanol or 3-iodopropanol in NMM (N-meth- 
ylmorpholine/water/2-propanol, 2/49/49, v/v/v) (three times), washing with 10 ul NMM (three times) and cleaving the 
55 alkylated T-terminated primer-extension products off the support by treatment with DTT as described in EXAMPLE 1 . 
Analysis of the mass-modified fragment families is performed with either ES or MALDl mass spectrometry. 
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EXAMPLE 18 

Analysis of a Mixture of Oligothymldylic Acids 

s [0092] Oligothymidylic acid, oligo p(dT) 12 -i8' ' s commercially available (United States Biochemical, Cleveland, OH). 
Generally, a matrix solution of 0.5 M in ethanol was prepared. Various matrices were used for this Example and Ex- 
amples 19- 21 such as 3,5-dihydroxybenzoic acid, sinapinic acid, 3-hydroxypicolinic acid, 2,4,6-trihydroxyacetophe- 
none. Oligonucleotides were lyophilized after purification by HPLC and taken up in ultrapure water (MilliQ, Millipore) 
using amounts to obtain a concentration of 1 0 pmoles/u.l as stock solution. An aliquot (1 u,t) of this concentration or a 

10 dilution in ultrapure water was mixed with 1 uJ of the matrix solution on a flat metal surface serving as the probe tip 
and dried with a fan using cold air. In some experiments, cation-ion exchange beads in the acid form were added to 
the mixture of matrix and sample solution. 

[0093] M ALDI-TOF spectra were obtained for this Example and Examples 1 9-21 on different commercial instruments 
such as Vision 2000 (Finnigan-MAT), VG TofSpec (Fisons Instruments), LaserTec Research (Vestec). The conditions 

*5 for this Example were linear negative ion mode with an acceleration voltage of 25 kV. The M ALDI-TOF spectrum 
generated is shown in FIGURE 14. Mass calibration was done externally and generally achieved by using defined 
peptides of appropriate mass range such as insulin, gramicidin S, trypsinogen, bovine serum albumen, and cytochrome 
C. All spectra were generated by employing a nitrogen laser with 5 nsec pulses at a wavelength of 337 nm. Laser 
energy varied between 1 0 6 and 10 7 W/cm 2 . To improve signal-to-noise ratio generally, the intensities of 10 to 30 laser 

20 shots were accumulated. 

EXAMPLE 19 

Mass Spectrometric Analysis of a 50-mer and a 99-mer 

25 

[0094] Two large oligonucleotides were analyzed by mass spectrometry. The 50-mer d 
(TAACGGTCATTACGGCCATTGACTGTAGGACCTGCATTACATGACTAGCT) (SEQ ID NO:3) and dT(pdT) 99 were 
used. The oligodeoxynucleotides were synthesized using p -cyanoethylphosphoamidites and purified using published 
procedures. (e.g. N.D. Sinha, J. Biernat, J. McManus and H. Koster, Nucleic Acids Res. , 12 , 4539 (1984)) employing 
30 commercially available DNA synthesizers from either Millipore (Bedford, MA) or Applied Biosystems (Foster City, CA) 
and HPLC equipment and RP18 reverse phase columns from Waters (Milford, MA). The samples for mass spectro- 
metric analysis were prepared as described in Example 18. The conditions used for MALDI-MS analysis of each oli- 
gonucleotide were 500 fmol of each oligonucleotide, reflectron positive ion mode with an acceleration of 5 kV and 
postacceleration of 20 kV. The MALDI-TOF spectra generated were superimposed and are shown in FIGURE 15. 

35 

EXAMPLE 20 

Simulation of the DNA Sequencing Results of FIGURE 2 

40 [0095] The 13 DNA sequences representing the nested dT-terminated fragments of the Sanger DNA sequencing for 
the 50-mer described in Example 19 (SEQ ID.NO:3) were synthesized as described in Example 19. The samples were 
treated and 500 fmol of each fragment was analyzed by MALDI-MS as described in Example 18.. The resulting MAL- 
DI-TOF spectra are shown in FIGURES 16A-16M. The conditions were reflectron positive ion mode with an acceleration 
of 5 kV and postacceleration of 20 kV. Calculated molecular masses and experimental molecular masses are shown 

^5 in Table 1 . 

[0096] The MALDI-TOF spectra were superimposed (FIGURES 17A and 17B) to demonstrate that the individual 
peaks are resolvable even between the 10-mer and 11-mer (upper panel) and the 37-mer and 38-mer (lower panel). 
The two panels show two different scales and the spectra analyzed at that scale. 

50 EXAMPLE 21 

MALDI-MS Analysis of a Mass-Modified Oligonucleotide 

[0097] A 17-merwas mass-modified at C-5 of one or two deoxyuridine moieties. 5-[13-(2-Methoxyethoxyl)-tridecyne- 
55 1-yl]-5'-0-(4,4'-dimethoxytrityl)-2'-deoxyuridine-3'-p-cyanoethyl-N, N-diisopropylphosphoamidite was used to synthe- 
size the modified 17-mers using the methods described in Example 19. 
[0098] The modified 17-mers were 
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X 


a: d (TAAAACGACGGCCAGUG) 
(SEQIDNO:4) 


(molecular mass: 5454) 


5 


b: d (UAAAACGACGGCCAGUG) 
(SEQ IDNO:5) 


(molecular mass 5634) 


where X - -CsC-(CH 2 )i ]-OH 


15 


(unmodified 17-mer: molecular mass: 5273) 


[0099] The samples were prepared and 500 fmol of each modified 17-mer was analyzed using MALDI-MS as de- 
scribed in Example 1 8. The conditions used were reflectron positive ion mode with an acceleration of 5 kV and postac- 
celeration of 20 kV. The MALDI-TOF spectra which were generated were superimposed and are shown in FIGURE 18. 
20 [0100] All of the above-cited references and publications are hereby incorporated by reference. 

EQUIVALENTS 

[0101] Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, 
25 numerous equivalents to the specific procedures described herein. Such equivalents are considered to be within the 
scope of this invention and are covered by the following claims. 


35 


40 
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SEQUENCE LISTING 


(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: KOSTER, HUBERT 

(B) STREET: 1640 MONUMENT STREET 

(C) CITY: CONCORD 

(D) STATE: MASSACHUSETTS 

(E) COUNTRY: USA 

<F) POSTAL CODE (ZIP) : 01742 
(G) TELEPHONE: (508) 369-9790 

(ii) TITLE OF INVENTION: DNA SEQUENCING BY MASS SPECTROMETRY 

(iii) NUMBER OF SEQUENCES: 5 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE : Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: ASCII (text) 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 06-JAN-1994 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/001,323 

(B) FILING DATE: 07 -JAN- 1993 

(C) CLASSIFICATION: 1807 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: DeConti, Giulio A. 

<B) REGISTRATION NUMBER: 31,503 

(C) REFERENCE /DOCKET NUMBER : HKI-003CP 

<ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 227-7400 

(B) TELEFAX: (617) 227-5941 


(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: YES 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
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CATGCCATGG CATG 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL; YES 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 
AAATTGTGCA CATCCTGCAG C 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL : YES 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TAACGGTCAT TACGGCCATT GACTGTAGGA CCTGCATTAC ATGACTAGCT 


(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS : 
* (A) LENGTH: 17 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: YES 


<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
TAAAACGACG GGCCAGXG 
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(2) INFORMATION FOR SEQ ID NO : 5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 
(iii) HYPOTHETICAL: YES 


20 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
XAAAACGACG GGCCAGXG 17 

Claims 

1 . A method for sequencing two or more nucleic acids, comprising: 

25 generating base-specifically terminated nucleic acid fragments from each of the nucleic acids to be sequenced; 

determining the molecular weight of each base-specifically terminated fragment by mass spectrometry; 
and determining the sequences of the nucleic acids by aligning the base-specifically terminated nucleic acid 
fragments according to molecular weight; 

30 wherein: 

the two or more nucleic acids are sequenced concurrently; and 

the base-specifically terminated nucleic acid fragments generated from one nucleic acid can be differentiated 
from the base-specifically terminated nucleic acid fragments generated from each of the other nucleic acids 
35 by molecular. weight. 

2. The method of claim 1 , wherein base-specifically terminated nucleic acid fragments generated from one or more 
of the nucleic acids are mass modified. 

40 3. The method of claim 2, wherein the mass-modified base-specifically terminated nucleic acid fragments are modified 
with a mass-modifying functionality (M) according to one or more of the following: 

(a) a mass-modifying functionality (M) that is at a heterocyclic base of at least one nucleotide; 

(b) a mass-modifying functionality (M) attached to the phosphate backbone; and 

45 (c) a mass-modifying functionality (M) attached to one or more sugar moieties in at least one sugar position 

selected from the group consisting of an internal C-2'position, an external C-2' position, and an external C-5' 
position. 

4. The method of claim 3, wherein the heterocyclic base-modified nucleotide is selected from the group consisting 
50 of a cytosine nucleotide modified at C-5, a thymine nucleotide modified at C-5, a thymine nucleotide modified at 

the C-5 methyl group, a uracil nucleotide modified at C-5, an adenine nucleotide modified at C-8, ac 7 -deazaadenine 
modified at C-7, a guanine nucleotide modified at C-8, a c 7 -deazaguanine modified at C-8, a c 7 -deazaadenine 
modified at C-8, a c 7 -deazaguanine modified at C-7, a hypoxanthine modified at C-8, a c 7 -deazahypoxanthine 
modified at C-7, and a c 7 -deazahypoxanthine modified at C-8. 

55 

5. The method of claim 1 or 2, wherein each base-specifically terminated nucleic acid fragment is coupled by a linking 
group (L) to a functionality (!_') on a solid support. 
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6. The method of claim 5, wherein the coupling of each base-specifically terminated nucleic acid fragment to the solid 
support is reversible. 

7. The method of claim 5 or 6, wherein the mass-modified base-specifically terminated nucleic acid fragments are 
5 modified with a mass-modifying functionality (M) attached to the sugar moiety of a 5'-terminal nucleotide and where- 
in the mass-modifying function (M) is a linking functionality (L). 

8. The method of claim 6, wherein the base-specifically terminated nucleic acid fragments are cleaved from the solid 
support prior to or during mass spectrometry. 

10 

9. The method of claim 6, wherein the base-specifically terminated nucleic acid fragments are cleaved from the solid 
support enzymatically, chemically or physically. 

10. The method of claim 6, wherein the coupling of the base-specifically terminated nucleic acid fragments to the solid 
'5 support is selected from the group consisting of a photocleavable bond, a bond based on strong electrostatic 

interaction, a tritylether bond, a 0-benzoylpropionyl group, a levulinyl group, a disulfide bond, an arginine/arginine 
bond, a lysine/lysine bond, a pyrophosphate bond, and a bond created by Watson-Crick base pairing. 

1 1 . The method of claim 2, wherein the mass-modified base-specif icalty terminated nucleic acid fragments are modified 
20 with a mass-modifying functionality (M) which is attached to the base-specifically terminated nucleic acid fragments 

subsequent to generation of the base-specifically terminated fragments and prior to determining the molecular 
weight of the fragments by mass spectrometry. 

12. The method of 11 , wherein the generation of the base-specifically terminated fragments is performed by using at 
25 least one reagent selected from the group consisting of a nucleic acid primer, a chain-elongating nucleotide, a 

chain-terminating nucleotide and a tag probe which has been modified with a precursor of the mass-modifying 
functionality, M, and a subsequent step comprises modifying the precursor of the mass-modifying functionality, M, 
to generate the mass-modifying functionality, M, prior to mass spectrometric analysis. 

30 13. The method of claim 1 , wherein the base-specifically terminated nucleic acid fragments from each of the nucleic 
acids to be sequenced are generated by synthesizing nucleic acids complementary to the nucleic acids to be 
sequenced starting from a nucleic acid primer and in the presence of chain-terminating and chain-elongating nu- 
cleotides. 

35 1 4. The method of claim 13, wherein at least one of the nucleic acid primer, a chain-elongating nucleotide, and a chain- 
terminating nucleotide is mass-modified. 

15. The method of claim 13 or claim 14, wherein the primer is reversibly linked to a solid support through a linking 
group and the fragments are cleaved from the solid support by a laser during mass spectrometry. 

40 

16. The method of claim 1, further comprising after the step of determining the molecular weight of each base-specif- 
ically terminated fragment by mass spectrometry: 

hybridizing the base-specifically terminated nucleic acid fragments with one or more tag probes; 
45 determining the molecular weight of each of the base-specifically terminated nucleic acids; and 

comparing the molecular weights of the base-specifically terminated nucleic acids before and after hybridiza- 
tion to the tag probe(s); 

wherein base-specifically terminated nucleic acid fragments generated from one or more of the nucleic acids 
50 to be sequenced comprise a tag sequence which specifically hybridizes to a tag probe; and 

the tag probes are differentiated by molecular weight. 

1 7. The method of claim 1 6, wherein the tag probe(s) are covalently bound to the corresponding tag sequence(s) prior 
to mass spectrometric analysis. 


55 


18. The method of claim 17, wherein binding between the tag probe(s) and the corresponding tag sequence(s) is 
achieved photochemically via photoactivatable groups. 
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19. The method of claim 16, wherein mass differentiation of the tag probes is achieved by changing the nucleotide 
composition of at least one of the tag probes and complementary tag sequence in a base-specifically terminated 
nucleic acid. 

s 20. The method of claim 16, wherein mass differentiation of the tag probes is achieved by mass modification of one 
or more tag probes. 

21 . The method of claim 13, wherein at least one of the nucleic acid primer, a chain-elongating nucleotide, and a chain- 
terminating nucleotide comprises a modified nucleotide. 

10 

22. The method of claim 21 , wherein the modified nucleotide is a phosphorothioate nucleotide. 

23. The method of claim 22, wherein the phosphorothioate nucleotide is an alkylated phosphorothioate nucleotide. 

15 24. The method of claim 5, wherein the coupling of the base-specifically terminated nucleic acid fragments to the solid 
support is effected by a bond cleavable by a pyrophosphatase. 

25. The method of claim 5, further comprising purifying the base-specifically terminated nucleic acid fragments by 
washing out remaining reactants and by-products. 

20 

26. The method of claim 1 , wherein a counter-ion of the phosphate backbone of the base-specifically terminated nucleic 
acid fragments is removed or is exchanged with a second counter-ion. 

27. The method of claim 1 , wherein the molecular weight of each fragment is determined by matrix-assisted laser 
25 desorption/ionization mass spectrometry (MALDI-MS) or electrospray mass spectrometry (ES-MS). 

28. A kit for sequencing two or more species of nucleic acids by multiplex mass spectrometric nucleic acid sequencing, 
comprising: 

30 a) a solid support having a linking functionality (L*); 

b) nucleic acid primers suitable for initiating synthesis of a set of nucleic acids which are complementary to 
the different species of nucleic acids, the primers each including a linking group (L) able to interact with the 
linking functionality (U) and reversibly link the primers to the solid support and optionally, a tag probe; 

c) chain-elongating nucleotides for synthesizing the complementary nucleic acids; and 

35 d) chain-terminating nucleotides for terminating synthesis of the complementary nucleic acids and generating 

sets of base-specifically terminated complementary nucleic acid fragments, 

wherein in the absence of a tag probe, at least one reagent selected from the group consisting of the primers, 
the chain-elongating nucleotides, and the chain-terminating nucleotides is mass modified to provide distinction 
40 between each set of base-specifically terminated nucleic acid fragments of each species of nucleic acid by mass 

spectrometry. 

29. The kit of claim 28, wherein the chain-elongating nucleotides comprise at least one nucleotide selected from the 
group consisting of deoxyadenosine triphosphate (dATP) deoxythymidine triphosphate (dTTP), deoxyguanosine 

45 triphosphate (dGTP), deoxycytidine triphosphate (DCTP), deoxyinosine triphosphate (dlTP), 7-deaza deoxyade- 

nosine triphosphate (c 7 dATP), 7-deaza deoxythymidine triphosphate (C 7 TTP), 7-deaza deoxyguanosine triphos- 
phate (c 7 dGTP), 7-deaza deoxycytidine triphosphate (c 7 dCTP) and 7-deaza deoxyinosine triphosphate (c 7 dlTP). 

30. The kit of claim 28, wherein the chain-terminating nucleotides comprise at least one nucleotide selected from the 
50 group consisting of dideoxyadenosine triphosphate (ddATP), dideoxythymidine triphosphate (ddTTP), dideoxy- 

guanosine triphosphate (ddGTP), di deoxycytidine triphosphate (ddCTP), 7-deaza dideoxyguanosine triphosphate 
(c 7 ddGTP), 7-deaza dideoxyadenosine triphosphate (c 7 ddATP), 7-deaza dideoxyinosine triphosphate (c 7 ddlTP). 

31. The kit of claim 28, wherein the chain-elongating nucleotides comprise at least one nucleotide selected from the 
55 group consisting of adenosine triphosphate (ATP), uridine triphosphate (UTP), guanosine triphosphate (GTP), 

cytidine triphosphate (CTP), inosine triphosphate (ITP), 7-deaza adenosine triphosphate (c 7 ATP), 7-deaza gua- 
nosine triphosphate (c 7 GTP), and 7-deaza inosine triphosphate (c 7 ITP). 
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32. The kit of claim 28, wherein the chain-terminating nucleotides comprise at least one nucleotide selected from the 
group consisting of deoxyadenosine triphosphate (3'-dATP), deoxyuridine triphosphate (S'-dUTP), deoxyguanos- 
ine triphosphate (3'-dGTP), deoxycytidine triphosphate (3'-dCTP), 7-deaza 3'deoxyadenosine triphosphate (c 7 
-3'dATP), 7-deaza 3'deoxyguanosine triphosphate (c 7 -3'dGTP) and 7-deaza 3'deoxyinosine (c 7 -3'dlTP). 

5 

33. The kit of claim 28, wherein the linkage between the linking group (L) and the linking functionality (U) is selected 
from the group consisting of a photocleavable bond, a tritylether bond, a p-benzoylpropionyl group, a levulinyl 
group, a disulfide bond, an arginine/arginine bond, a lysine/lysine bond, and a pyrophosphate bond and a bond 
created by Watson-Crick base pairing. 

10 

34. The kit of claim 28, wherein the mass modified reagent is modified with a mass-modifying fu nctionality (M) according 
to one or more of the following: 

(a) a mass-modifying functionality (M) that is at a heterocyclic base of at least one nucleotide; 
is (b) a mass-modifying functionality (M) that, when incorporated into a base-specifically terminated nucleic acid 

fragment, is attached to the phosphate backbone; and 

(c) a mass-modifying functionality (M) attached to one or more sugar moieties in at least one sugar position 
selected from the group consisting of a C-2' position, an external C-3' position, and an external C-5' position. 

20 35. The kit of claim 34, wherein the heterocyclic base-modified nucleotide is selected from the group consisting of a 
cytostne nucleotide modified at C-5, a thymine nucleotide modified at C-5, a thymine nucleotide modified at the 
C-5 methyl group, a uracil nucleotide modified at C-5, an adenine nucleotide modified at C-8, an adenine nucleotide 
modified at C-7, a c 7 -deazaadenine modified at C-8, a c 7 -deazaadenine modified at C-7, a guanine nucleotide 
modified at C-8, a guanine nucleotide modified at C-7, a c 7 -deazaguanine modified at C-8, a c 7 -deazaguanine 

25 modified at C-7, a hypoxanthine modified at C-8, a c 7 -deazahypoxanthine modified at C-7, and a c 7 -deazahypox- 

anthine modified at C-8. 

36. The kit of claim 28, wherein the mass modified reagent is modified with a mass-modifying functionality (M) attached 
to the sugar moiety of a B'-terminal nucleotide and wherein the mass-modifying function (M) is the linking func- 

30 tionality (L). 

37. The kit of claim 28, wherein the primer or the tag probe comprises a deoxyribonucleotide selected from the group 
consisting of: a 7-deaza deoxyadenosine triphosphate, (c 7 dA), a 7-deaza deoxyguanosine triphosphate (c 7 dG) 
and a 7-deaza deoxyinosine triphosphate (c 7 dl). 

35 

38. The kit of claim 28, wherein the primer or the tag probe comprises a ribonucleotide selected from the group con- 
sisting of: 7-deaza adenine (c 7 A), 7-deaza guanine (c 7 G) and 7-deaza inosine (c 7 I). 


39. A kit for sequencing a nucleic acid by mass spectrometry, comprising: 

40 

a) a solid support having a linking functionality (L 1 ); 

b) one or more nucleic acid primers suitable for initiating synthesis of complementary nucleic acids which are 
complementary to the nucleic acid to be sequenced, the primers each including a linking group (L) able to 
interact with the linking functionality (!_') and reversibly immobilize the primers on the solid support; N 

45 c ) chain-elongating nucleotides for synthesizing the complementary nucleic acids; and 

d) chain-terminating nucleotides for terminating synthesis of the complementary nucleic acids and generating 
sets of base-specifically terminated complementary nucleic acid fragments, 

wherein the chain-terminating nucleotides are mass-modified so that addition of one chain -terminating nu- 
50 cleotide to the complementary nucleic acid can be distinguished by mass spectrometry from addition of all other 

chain-terminating nucleotides concurrently analysed. 


40. The kit of claim 39, wherein the chain-elongating nucleotides comprise at least one nucleotide selected from the 
group consisting of deoxyadenosine triphosphate (dATP) deoxythymidine triphosphate (dTTP), deoxyguanosine 
triphosphate (dGTP), deoxycytidine triphosphate (dCTP), deoxyinosine triphosphate (dITP), a 7-deazadeoxygua- 
nosine triphosphate (c 7 dGTP), a 7-deazadeoxyadenosine triphosphate (c 7 dATP), and a 7-deazadeoxyinosine 
triphosphate (c 7 dITP). 


29 


10 


EP 1 262 564 A2 

41. The kit of claim 39, wherein the chain -terminating nucleotides comprise at least one nucleotide selected from the 
group consisting of dideoxyadenosine triphosphate (ddATP), dideoxythymidine triphosphate (ddTTP), dideoxy- 
guanosine triphosphate (ddGTP), dideoxycytidine triphosphate (ddCTP), 7-deazadideoxyguanosine triphosphate 
(c 7 ddGTP), 7-deazadideoxyadenosine triphosphate (Cy ddATP), 7-deazadideoxyinosine triphosphate (c 7 ddlTP). 

42. The kit of claim 39, wherein the chain-elongating nucleotides comprise a nucleotide selected from the group con- 
sisting of adenosine triphosphate (ATP), uridine triphosphate (UTP), guanosine triphosphate (GTP), cytidine tri- 
phosphate (CTP), inosine triphosphate (ITP), a 7-deazaadenosine triphosphate (c 7 ATP), a 7-deazaguanosine 
triphosphate (c 7 GTP), and a 7-deazainosine triphosphate (c 7 ITP). 

43. The kit of claim 39, wherein the chain-terminating nucleotides comprise at least one nucleotide selected from the 
group consisting of deoxyadenosine triphosphate (3'-dATP), deoxyuridine triphosphate (3'-dUTP), deoxyguanos- 
ine triphosphate (3'-dGTP), deoxycytidine triphosphate (3'-dCTP), 7-deaza 3'deoxyadenosine triphosphate (c 7 
-3'dATP), 7-deaza 3'deoxyguanosine triphosphate (c 7 -3'dGTP) and 7-deaza 3'deoxyinosine triphosphate (c 7 

is -3'dlTP). 

44. The kit of claim 39, wherein the linkage between the linking group (L) and the linking functionality (!_') is selected 
from the group consisting of a photocieavable bond, a tritylether bond, a p-benzoylpropionyl group, a levulinyl 
group, a disulfide bond, an arginine/arginine bond, a lysine/lysine bond, and a pyrophosphate bond and a bond 

20 created by Watson -Crick base pairing. 

45. The kit of claim 39, wherein the mass modified chain -terminating nucleotide is modified according to one or more 
of the following: 

25 (a) a mass-modifying functionality (M) that is at a heterocyclic base; 

(b) a mass-modifying functionality (M) that, when incorporated into a base-specifically terminated nucleic acid 
fragment, is attached to the phosphate backbone; and 

(c) a mass-modifying functionality (M) attached to one or more sugar moieties in at least one sugar position 
selected from the group consisting of a C-2' position, an external C-3' position, and an external C-5' position. 

30 

46. The kit of claim 45, wherein the heterocyclic base-modified nucleotide is selected from the group consisting of a 
cytosine nucleotide modified at C-5, a thymine nucleotide modified at C-5, a thymine nucleotide modified at the 
C-5 methyl group, a uracil nucleotide modified at C-5, an adenine nucleotide modified at C-8, an adenine nucleotide 
modified at C-7 ( a c 7 -deazaadenine modified at C-8, a c 7 -deazaadenine modified at C-7, a guanine nucleotide 

35 modified at C-8, a guanine nucleotide modified at C-7, a c 7 -deazaguanine modified at C-8, a c 7 -deazaguanine 

modified at C-7, a hypoxanthine modified at C-8, a c 7 -deazahypoxanthine modified at C-7, and a c 7 -deazahypox- 
anthine modified at C-8. 

47. An intact ionized and volatilized mass-modified nucleic acid molecule, comprising at least two mass-modified nu- 
40 cleotides, wherein the molecule is positively charged. 

48. An intact ionized mass-modified nucleic acid molecule of claim 47, comprising at least two mass-modified nucle- 
otides containing a mass-modifying functionality (M) attached to a heterocyclic base of the nucleotide. 

*5 49, An intact ionized mass-modified nucleic acid molecule of claim 47, comprising at least two mass-modified nucle- 
otides containing a mass-modifying functionality (M) attached to at least one phosphorus of the nucleotide. 

50. An ionized intact mass-modified nucleic acid molecule of claim 47, wherein a mass-modifying functionality (M) 
incorporated into the molecule is XR, wherein X is selected from the group consisting of -O-, -NH-, -S-, -NHC(S)-, 

50 -OCO(CH 2 ) r COO- (where r=1 -20), -NHCO(CH 2 ) r COO- (where r=1 -20), -OS0 2 0- and R is selected from the group 

consisting of H, methyl, ethyl, propyl, isopropyl, t-butyl, hexyl, benzyl, benzhydryl, halogen, trityl, substituted trityl, 
aryl, substituted aryl, polyoxymethylene, monoalkylated polyoxymethylene, a polyethylene imine, -(NH(CH 2 ) r NH- 
CO(CH 2 ) r CO-) m -NH-(CH 2 ) r -NH-CO-(CH 2 ) r -COOH, -(NH(CH 2 ) r CO-) m -NH-(CH 2 ) r -COOH, -(0(CH 2 ) r CO-) m - 
0-(CH 2 ) r -COOH ( -Si(Y) 3 , -(NHCHaaCOOH), -(CH 2 CH 2 0) m -CH 2 CH 2 OH, and -(CH 2 CH 2 0) m -CH 2 CH 2 0-Y, where 

55 m is in the range of 0 to 200, Y is a lower alkyl group selected from a group consisting of methyl, ethyl, propyl, 

isopropyl, t-butyl and hexyl, r is in the range of 1 to 20, and aa represents the amino acid side chain of a naturally 
occurring amino acid. 


30 


• 


EP 1 262 564 A2 

51. An intact ionized mass-modified nucleic acid molecule, comprising: 

at least one mass modified nucleotide, wherein the molecule is positively charged, and comprises a member 
selected from the group consisting of: a mass-modified universal primer and a mass-modified initiator oligo- 
5 nucleotide. 

52. An ionized mass-modified nucleic acid molecule of claim 51 , wherein a mass-modifying functionality (M) is attached 
to at least one sugar moiety of a 5'-terminal nucleotide of the primer, and wherein the mass-modifying function (M) 
is a linking functionality (L). 

10 

53. An intact positively charged ionized mass-modified nucleic acid molecule, comprising: 

at least one mass-modified nucleotide containing a modified heterocyclic base selected from a group consisting 
of a cytosine moiety modified at C-5, a thymine moiety modified at C-5, a thymine moiety modified at the methyl 
15 group of C-5, a uracil moiety modified at C-5, an adenine moiety modified at C-8, a c 7 -deazaadenine moiety 

modified at C-8, a c 7 -deazaadenine moiety modified at C-7, a guanine moiety modified at C-8, a c 7 -deaza- 
guanine moiety modified at C-8, a c 7 -deazaguanine moiety modified at C-7, a hypoxanthine moiety modified 
at C-8, a c 7 -deazahypoxanthine moiety modified at C-8, and a c 7 -deazahypoxanthine moiety modified at C-7. 

20 54. An intact positively charged ionized mass-modified nucleic acid molecule, comprising: 

at least one mass-modified nucleotide containing a mass-modifying functionality (M) attached to at least one 
sugar moiety of the nucleotide. 

25 55. An intact positively charged ionized mass-modified nucleic acid molecule, comprising: 

a mass-modifying functionality (M) attached to at least one sugar moiety of the nucleic acid molecule, wherein 
the sugar is modified at a position selected from the group consisting of an internal C-2 1 position, an external 
C-2' position, and an external C-5 1 position, 

30 

56. An intact ionized mass-modified nucleic acid molecule, comprising at least one mass-modified nucleotide contain- 
ing a mass-modifying functionality (M) incorporated into the molecule, wherein (M) is selected from the group 
consisting of F, CI, Br, I, Si(CH 3 ) 3 , Si(CH 3 ) 2 (C 2 H5), Si(CH 3 )(C 2 H 5 ) 2 , Si(C 2 H 5 ) 3 , CH 2 F, CHF 2 , and CF 3 . 

35 57. An intact positively charged ionized mass-modified nucleic acid molecule, comprising: 

at least two mass-modified nucleotides, wherein a mass-modifying functionality (M) incorporated into the at 
least one mass-modified nucleotide is generated from a precursor functionality (PF) attached to one or more 
of a nucleic acid primer, a chain-elongating nucleoside triphosphate or a chain-terminating nucleoside triphos- 
40 phate, and wherein the precursor functionality (PF) is selected from the group consisting of -N 3 , -NH 2 , -SH, 

-NCS, -OCO(CH 2 ) r COOH (where r=1-20), -NHCO(CH 2 ) r COOH (where r=1-20), -OS0 2 OH, -OCO(CH 2 ) r l 
(where r=1-20), and -OP(0-Alkyl)N(Alkyl) 2 . 

58. An intact positively charged ionized mass-modified nucleic acid molecule, comprising: 

45 

two or more mass modified nucleotides selected from the group consisting of a mass-modified 2'-deoxynu- 
cteotide, a mass-modified 2',3'-dideoxynucleotide, a mass-modified nucleotide and a mass-modified 3'-deox- 
ynucleotide, wherein the two or more mass-modified nucleotides are different from each other. 

50 59. An ionized mass-modified nucleic acid molecule, comprising at least one mass modified nucleotide selected from 
the group consisting of a mass-modified 2'-deoxynucleotide, a mass-modified 2',3'-dideoxynucleotide, a mass- 
modified nucleotide and a mass-modified 3'-deoxynucleotide- wherein the mass modified nucleic acid molecule 
comprises a modified heterocyclic base selected from a group consisting of modified heterocyclic base is a c 7 - 
deazaadenine moiety modified at C-8, a c 7 -deazaadenine moiety modified at C-7, a c 7 -deazaguanine moiety mod- 

55 jfjed at C-8, a c 7 -deazaguanine moiety modified at C-7, a hypoxanthine moiety modified at C-8, a c 7 -deazahypox- 

anthine moiety modified at C-8, and a c 7 -deazahypoxanthine moiety modified at C-7. 

60. An ionized intact mass-modified nucleic acid molecule, comprising at least one mass modified nucleotide wherein 
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a mass-modifying functionality (M) incorporated into the molecule is generated from a precursor functionality (PF) 
attached to one or more of a nucleic acid primer, a chain-elongating nucleoside triphosphate or a chain-terminating 
nucleoside triphosphate, and wherein the precursor functionality (PF) is selected from a group consisting of -N 3 , 
-NH 2 , -SH, -NCS, -OCO(CH 2 ) r COOH (where r=1-20), -NHCO(CH 2 ) r COOH (where r=1-20), -OS0 2 OH, -OCO 
s (CH 2 ) r l (where r=1-20), -CONH 2 , -NH-C(S)-NH 2 , OP(0-Alkyl)OH p and 0-CO-CH 2 -SH. 

61 . An ionized intact mass-modified nucleic acid molecule, comprising at least two mass modified nucleotides, wherein 
a mass-modifying functionality (M) incorporated into the molecule is XR, wherein X is selected from the group 
consisting of -O-, -NH-, -S-, -NHC(S)-, -OCO(CH 2 ) r COO-(where r=1-20), -NHCO(CH 2 ) r COO- (where r=1-20), 

10 -OS0 2 0- and -OP(0-Alkyl)0- and R is selected from the group consisting of H, methyl, ethyl, propyl, isopropyl, t- 

butyl, hexyl, benzyl, benzhydryl, halogen, trityl, substituted trityl, aryl, substituted aryl, (-NH(CH 2 ) r NHCO 
(CH 2 ) r CO-) m -NH-(CH 2 ) r -NH-CO-(CH 2 ) r -COOH I (-NH(CH 2 ) r CO-) m -NH-(CH 2 ) r -COOH, (-0(CH 2 ) r CO-) m - 
0-(CH 2 ) r -COOH, -Si(Y) 3 , -(NHCHaaCO-) m -NHCHaaCOOH, -(CH 2 CH 2 0) m -CH 2 CH 2 OH, and 
-(CH 2 CH 2 0) m -CH 2 CH 2 0-Y, where m is in the range of 0 to 200, Y is a lower alkyl group selected from a group 

'5 consisting of methyl, ethyl, propyl, isopropyl, t-butyl and hexyl, r is in the range of 1 to 20, and aa represents the 

amino acid side chain of a naturally occurring amino acid. 

62. An ionized intact mass-modified nucleic acid molecule, comprising at least two mass modified nucleotides, wherein 
a mass-modifying functionality (M) incorporated into the molecule is XR, wherein X is selected from the group 

20 consisting of -O-, -NH-, -S-, -NHC(S)- t OCO(CH 2 ) r COO- (where r=1-20), -NHC(O), -CONH-, -NH-C(S)-NH-, -NH- 

CO(CH 2 ) r COO- (where r=1-20), -OS0 2 0-, -OCO-CH 2 -S- and -OP(0-Alkyl)0- and R is selected from the group 
consisting of H, N 3 , methyl, ethyl, propyl, isopropyl, t-butyl, hexyl, benzyl, benzhydryl, halogen, trityl, substituted 
trityl, aryl, substituted aryl, (CH 2 ) m -CH 2 -OH, (CH 2 ) m -CH 2 -0-Y, (CH 2 CH 2 NH) m -CH 2 -CH 2 -NH 2 , -(NH(CH 2 ) r NHCO 
(CH 2 ) r CO-) m -NH-(CH 2 ) r -NH-CO-(CH 2 ) r -COOH, -(NH(CH 2 ) r CO-) m -NH-(CH 2 ) r -COOH, 

25 -(NH-CHY-CO^-NH-CHY-COOH, (-0(CH 2 ) r CO-) m -0-(CH 2 ) r -COOH, -Si(Y) 3 , -(NHCHaaCO-) m -NHCHaaCOOH, 

CH 2 F, CHF 2 , CF 3 , -(CH 2 CH 2 0) m -CH 2 CH 2 OH, and -(CH 2 CH 2 0) m -CH 2 CH 2 0-Y, where m is in the range of 0 to 200, 
Y is a lower alkyl group selected from a group consisting of methyl, ethyl, propyl, isopropyl, t-butyl and hexyl, r is 
in the range of 1 to 20, and aa represents the amino acid side chain of a naturally occurring amino acid. 

30 63. A set of mass-differentiated tag probes wherein, 

each tag probe in the set comprises a sequence of nucleotides which is complementary by Watson-Crick 
base pairing to a tag sequence present within at least one set of base-specifically terminated fragments; 
the tag sequences to which each tag probe is complementary are different for each tag probe; 
each tag probe in the set comprises at least one mass-modified nucleotide; and 
35 the mass-modified nucleotides are not isotopically labeled and have different mass modifications in each tag 

probe. 

64. The set of mass-differentiated tag probes of claim 63, wherein at least one of the mass-modified nucleotides 
comprises a mass-modifying functionality (M) attached to the heterocyclic base. 

40 

65. The set of mass-differentiated tag probes of claim 64, wherein the mass-modified heterocyclic base is selected 
from the group consisting of a cytosine moiety modified at C-5, a thymine moiety modified at C-5, a thymine moiety 
modified at the C-5 methyl group, a uracil moiety modified at C-5, an adenine moiety modified at C-8, a c 7 -dea- 
zaadenine moiety modified at C-8, a, a c 7 -deazaadenine moiety modified at C-7, a guanine moiety modified at C- 

•*5 8, a c 7 -deazaguanine moiety modified at C-8, a c 7 -deazaguanine moiety modified at C-7, a hypoxanthine moiety 

modified at C-8 ( a c 7 -deazahypoxanthine moiety modified at C-8, and a c 7 -deazahypoxanthine moiety modified at 
C-7. 

66. The set of mass-differentiated tag probes of claim 63, wherein at least one of the mass-modified nucleotides 
50 comprises a mass-modifying functionality (M) attached to the phosphorus atom forming an internucleotidic linkage 

of the tag probe. 

67. The set of mass-differentiated tag probes of claim 63, wherein at least one of the mass-modified nucleotides 
comprises a mass-modifying functionality (M) attached to the sugar moiety. 

55 

68. The set of mass-differentiated tag probes of claim 63, wherein at least one of the tag probes further comprises a 
cross-linking group (CL) which allows for covalent binding to the corresponding and complementary tag sequences. 
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69. The set of mass-differentiated tag probes of claim 68, wherein the cross-linking group (CL) is activated photo- 
chemicaliy and is derived from at least one photoactivatable group selected from the group consisting of psoralen 
and an ellipticine. 

5 70. The set of mass -differentiated tag probes of claim 63, wherein at least one of the tag probes is mass-modified with 
a mass-modifying functionality (M) selected from the group consisting of XR, F, CI, Br, I, Si(CH 3 ) 3 , Si(CH 3 ) 2 (C 2 H 5 ), 
Si(CH 3 )(C 2 H 5 ) 2 , Si{C 2 H 5 ) 3 , CH 2 F, CHF 2 , and CF 3 , wherein X is selected from the group consisting of -O-, -NH-, 
-S-, -NHC(S)- ( -OCO(CH 2 ) r COO- (where r=1-20), -NHCO(CH 2 ) r COO- (where r=1-20), -OS0 2 0-, and R is selected 
. from the group consisting of H, methyl, ethyl, propyl, isopropyl, t-butyl, hexyl, benzyl, benzhydryl, halogen, trityl, 

10 substituted trityl, aryl, substituted aryl, polyoxymethylene, monoalkylated polyoxymethylene, a polyethylene imine, 

-(NH(CH 2 ) r NHCO(CH 2 ) r CO-) m -NH-(CH 2 ) r -NH-CO-(CH 2 ) r -COOH, -(NH(CH 2 ) r CO-) m -NH-(CH 2 ) r -COOH, -(O 
(CH 2 ) r CO-) m -0-(CH 2 ) r -COOH, -Si(Y) 3 , -(NHCHaaCOOH), -(CH 2 CH 2 0) m -CH 2 CH 2 OH, and 
-(CH 2 CH 2 0) m -CH 2 CH 2 0-Y, where m is in the range of 0 to 200, Y is a lower alkyl group selected from a group 
consisting of methyl, ethyl, propyl, isopropyl, t-butyl and hexyl, r is in the range of 1 to 20, and aa represents the 

*s amino acid side chain of a naturally occurring amino acid. 

71 . The set of mass-differentiated tag probes of claim 63, wherein one or more mass-modifying functionalities incor- 
porated into the probes are generated from a precursor functionality (PF) attached to the mass-differentiated tag 
probes, and wherein the precursor functionalities are selected from the group consisting of -N 3 , -NH 2 , -SH, -NCS, 

20 -OCO(CH 2 ) r COOH (where r=1-20), -NHCO(CH 2 ) r COOH (where r=1-20), -OS0 2 OH, -OCO(CH 2 ) r l (where r=1-20), 

-CONH 2 , -NH-C(S)-NH 2 , OP(0-Alkyl)OH, -OP(0-Aikyl)N(Alkyl) 2 , and 0-CO-CH 2 -SH. 

72. The set of mass -differentiated tag probes of claim 63, wherein the tag probes are mass-modified with a mass- 
modifying functionality (M) selected from the group consisting of XR, F, CI, Br, I, Si(CH 3 ) 3 , Si(CH 3 ) 2 (C 2 H 5 ), Si(CH 3 ) 

25 (C 2 H 5 ) 2 , Si(C 2 H 5 ) 3 , CH 2 F, CHF 2 , and CF 3 , wherein X is selected from the group consisting of -O-, -NH-, -S-, -NHC 

(S)-, -OCO(CH 2 ) r COO- (where r=1-20), -NHCO(CH 2 ) r COO- (where r=1-20), -OS0 2 0- and -OP(0-Alkyl)0- and R 
is selected from the group consisting of H, N 3 , methyl, ethyl, propyl, isopropyl, t-butyl, hexyl, benzyl, benzhydryl, 
halogen, trityl, substituted trityl, aryl, substituted aryl, (-NH(CH 2 ) r NHCO 
(CH 2 ) r CO-) m -NH-(CH 2 ) r -NH-CO-(CH 2 ) r -COOH, (-NH(CH 2 ) r CO-) m -NH-(CH 2 ) r -COOH, (-0(CH 2 ) r CO-) m - 

30 0-(CH 2 ) r -COOH, -Si(Y) 3 , -(NHCHaaCO-) m -NHCHaaCOOH, -(CH 2 CH 2 0) m -CH 2 CH 2 OH, and 

-(CH 2 CH 2 0) m -CH 2 CH 2 0-Y, where m is in the range of 0 to 200, Y is a lower alkyl group selected from a group 
consisting of methyl, ethyl, propyl, isopropyl, t-butyl and hexyl, r is in the range of 1 to 20, and aa represents the 
amino acid side chain of a naturally occurring amino acid. 

35 73. The set of mass-differentiated tag probes of claim 63, wherein the tag probes are mass-modified with a mass- 
modifying functionality (M) selected from the group consisting of XR, F, CI, Br, I, Si(CH 3 ) 3 , Si(CH 3 ) 2 (C 2 H 5 ), Si(CH 3 ) 
(C 2 H 5 ) 2 , Si(C 2 H 5 ) 3 , CH 2 F, CHF 2 , and CF 3 , wherein X is selected from the group consisting of -O-, -NH-, -S-, -NHC 
(S)-, -OCO(CH 2 ) r COO- (where r=1-20), -NHC(O), -CONH-, -NH-C(S)-NH-, -NHCO(CH 2 ) r COO- (where r=1-20), 
-OS0 2 0-, -0-CO-CH 2 -S- and -OP(0-Alkyl)0- and R is selected from the group consisting of H, N 3 , methyl, ethyl, 

40 propyl, isopropyl, t-butyl, hexyl, benzyl, benzhydryl, halogen, trityl, substituted trityl, aryl, substituted aryl, 

(CH 2 ) m -CH 2 -OH, (CH 2 ) m -CH 2 -0-Y, (CH 2 CH 2 NH) m -CH 2 -CH 2 -NH 2 , -(NH(CH 2 ) r NHCO 

(CH 2 ) r CO-) m -NH-(CH 2 ) r -NH-CO-(CH 2 ) r -COOH, -(NH(CH 2 ) r CO-) m -NH-(CH 2 ) r -COOH, -(NH-CHY-CO) 

m-NH-CHY-COOH, (-0(CH 2 ) r CO-) m -0-(CH 2 ) r -COOH, -Si(Y) 3 , -(NHCHaaCO-) m -NHCHaaCOOH, CH 2 F, CHF 2 , 
CF 3 , -{CH 2 CH 2 0) m -CH 2 CH 2 OH 1 and -(CH 2 CH 2 0) m -CH 2 CH 2 0-Y, where m is in the range of 0 to 200, Y is a lower 

45 alkyl group selected from a group consisting of methyl, ethyl, propyl, isopropyl, t-butyl and hexyl, r is in the range 

of 1 to 20, and aa represents the amino acid side chain of a naturally occurring amino acid. 

74. An ionized positively charged intact duplex, comprising a mass-modified tag probe bound to a tag sequence present 
within a base-specifically terminated nucleic acid fragment, wherein the mass-modified tag probe comprises at 

50 least one mass-modified nucleotide. 

75. An ionized intact duplex, comprising a mass-modified tag probe bound to a tag sequence present within a base- 
specifically terminated nucleic acid fragment, wherein the mass-modified tag probe comprises at least one mass- 
modified nucleotide selected from the group consisting of a mass-modified nucleotide comprising a mass-modifying 

55 functionality (M) attached to the heterocyclic base, a mass-modified nucleotide comprising a mass-modifying func- 

tionality (M), which, when incorporated into the tag probe, is attached to the phosphorus atom forming an internu- 
cleotidic linkage of the tag probe and a mass-modified nucleotide comprising a mass-modifying functionality (M) 
attached to the sugar moiety. 
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76. The ionized duplex of claim 75, wherein the mass-modified heterocyclic base is selected from the group consisting 
of a cytosine moiety modified at C-5, a thymine moiety modified at C-5, a thymine moiety modified at the C-5 
methyl group, a uracil moiety modified at C-5, an adenine moiety modified at C-8, a c 7 -deazaadenine moiety 
modified at C-8, a, a c 7 -deazaadenine moiety modified at C-7, a guanine moiety modified at C-8, a c 7 -deazaguanine 

5 moiety modified at C-8, a c 7 -deazaguanine moiety modified at C-7, a-hypoxanthine moiety modified at C-8, a c 7 - 

deazahypoxanthine moiety modified at C-8, and a c 7 -deazahypoxanthine moiety modified at C-7. 

77. An ionized intact duplex, comprising a mass-modified tag probe bound to a tag sequence present within a base- 
specifically terminated nucleic acid fragment, wherein: the mass-modified tag probe comprises at least one mass- 

10 modified nucleotide; and the tag probe further comprises a cross-linking group (CL) which allows for covalent 

binding to the tag sequence. 

78. The ionized duplex of claim 77, wherein the cross-linking group (CL) is activated photochemically and is derived 
from at least one photoactivatabie group selected from the group consisting of psoralen and an ellipticine. 

15 

79. An ionized intact duplex, comprising a mass-modified tag probe bound to a tag sequence present within a base- 
specifically terminated nucleic acid fragment, wherein: the mass-modified tag probe comprises at least one mass- 
modified nucleotide; and the tag probe is mass-modified with a mass-modifying functionality (M) selected from the 
group consisting of XR, F, CI, Br, I, Si(CH 3 ) 3 , Si(CH 3 ) 2 (C 2 H 5 ), Si(CH 3 )(C 2 H 5 ) 2 , Si(C 2 H 5 ) 3 , CH 2 F, CHF 2 , and CF 3 , 

20 wherein X is selected from the group consisting of -O-, -NH-, -S-, -NHC(S)-, -OCO(CH 2 ) r COO- (where r=1-20), 

-NHCO(CH 2 ) r COO- (where r=1-20), -OS0 2 0- and -OP(0-Alkyl)0- and R is selected from the group consisting of 
H, methyl, ethyl, propyl, isopropyl, t-butyl, hexyl, benzyl, benzhydryl, halogen, trityl, substituted trityl, aryl, substi- 
tuted aryl, pjolyoxymethylene, monoalkylated polyoxymethylene, a polyethylene imine, -(NH(CH 2 ) r NHCO 
(CH 2 ) r CO-) m -NH-(CH 2 ) r -NH-CO-(CH 2 ) r -COOH, -(NH(CH 2 ) r CO-) m -NH-(CH 2 ) r -COOH, -(0(CH 2 ) r CO-) m - 

25 0-(CH 2 ) r -COOH, -Si(Y) 3 , -(NHCHaaCO) m -HCHaaCOOH, -(CH 2 CH 2 0) m -CH 2 CH 2 OH, and 

-(CH 2 CH 2 0) m -CH 2 CH 2 0-Y, where m is in the range of 0 to 200, Y is a lower alkyl group selected from a group 
consisting of methyl, ethyl, propyl, isopropyl, t-butyl and hexyl, r is in the range-of 1 to 20, and aa represents the 
amino acid side chain of a naturally occurring amino acid. 

30 BO. An ionized intact duplex, comprising a mass-modified tag probe bound to a tag sequence present within a base- 
specifically terminated nucleic acid fragment, wherein: the mass-modified tag probe comprises at least one mass- 
modified nucleotide; and the tag probe is mass-modified with a mass-modifying functionality (M) selected from the 
group consisting of XR, F, CI, Br, I, Si(CH 3 ) 3 , Si(CH 3 ) 2 (C 2 H 5 ), Si(CH 3 )(C 2 H 5 ) 2 , Si(C 2 H 5 ) 3) CH 2 F, CHF 2 , and CF 3 , 
wherein X is selected from the group consisting of -0-..-NH-, -S-, -NHC(S)-, -OCO(CH 2 ) r COO- (where r=1-20), 

35 -NHCO(CH 2 ) r COO- (where r=1-20), -OS0 2 0- and -OP(0-Alkyl)0- and R is selected from the group consisting of 

H, methyl, ethyl, propyl, isopropyl, t-butyl, hexyl, benzyl, benzhydryl, halogen, trityl, substituted trityl, aryl, substi- 
tuted aryl, (-NH(CH 2 ) r NHCO(CH 2 ) r CO-) m -NH-(CH 2 ) r -NH-CO-(CH 2 ) r -COOH I (-NH 
(CH 2 ) r CO-) m -NH-(CH 2 ) r -COOH, (-0(CH 2 ) r CO-) m -0-(CH 2 ) r -COOH, -Si(Y) 3 , -(NHCHaaCO-) m -NHCHaaCOOH, 
-(CH 2 CH 2 0) m -CH 2 CH 2 OH, and -(CH 2 CH 2 0) m -CH 2 CH 2 0-Y, where m is in the range of 0 to 200, Y is a lower alkyl 

40 group selected from a group consisting of methyl, ethyl, propyl, isopropyl, t-butyl and hexyl, r is in the range of 1 

to 20, and aa represents the amino acid side chain of a naturally occurring amino acid. 

81. An ionized intact duplex, comprising a mass-modified tag probe bound to a tag sequence present within a base- 
specifically terminated nucleic acid fragment, wherein: the mass-modified tag probe comprises at least one mass- 
es modified nucleotide; and the tag probe is mass-modified with a mass-modifying functionality (M) selected from the 

group consisting of XR, F, CI, Br, I, Si(CH 3 ) 3 , Si(CH 3 ) 2 (C 2 H 5 ), Si(CH 3 )(C 2 H 5 ) 2 , Si(C 2 H 5 ) 3 , CH 2 F, CHF 2 , and CF 3 , 
wherein X is selected from the group consisting of -O-, -NH-, -S-, -NHC(S)-, -OCO(CH 2 ) r COO- (where r=1-20), 
-NHC(O), -CONH-, -NH-C(S)-NH-, -NHCO(CH 2 ) r COO- (where r=1-20), -OS0 2 0-, -0-CO-CH 2 -S- and -OP 
(O-Alkyl)O- and R is selected from the group consisting of H, N 3 , methyl, ethyl, propyl, isopropyl, t-butyl, hexyl, 

so benzyl, benzhydryl, halogen, trityl, substituted trityl, aryl, substituted aryl, (CH 2 ) m -CH 2 -OH, (CH 2 ) m -CH 2 -0-Y, 

(CH 2 CH 2 NH) m -CH 2 -CH 2 -NH 2 , -(NH(CH 2 ) r NHCO(CH 2 ) r CO-) m -NH-(CH 2 ) r -NH-CO-(CH 2 ) r -COOH, -(NH 
(CH 2 ) r CO-) m -NH-(CH 2 ) r -COOH, -(NH-CHY-CO)m-NH-CHY-COOH, (-0(CH 2 ) r CO-) m -0-(CH 2 ) r -COOH, -Si(Y) 3 , 
-(NHCHaaCO-) m -NHCHaaCOOH, CH 2 F, CHF 2 , CF 3 , -(CH 2 CH 2 0) m -CH 2 CH 2 OH, and -(CH 2 CH 2 0) m -CH 2 CH 2 0-Y, 
where m is in the range of 0 to 200, Y is a lower alkyl group selected from a group consisting of methyl, ethyl, 

55 propyl, isopropyl, t-butyl and hexyl, r is in the range of 1 to 20, and aa represents the amino acid side chain of a 

naturally occurring amino acid. 

82. An ionized intact duplex, comprising a mass-modified tag probe bound to a tag sequence present within a base- 
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specifically terminated nucleic acid fragment, wherein: the mass-modified tag probe comprises at least one mass- 
modified nucleotide; and one or more mass-modifying functionalities (M)'incorporated into the tag probe are gen- 
erated from one or more precursor functionalities (PF) attached to the tag probe, and wherein the precursor func- 
tionalities (PF) are selected from the group consisting of-N 3 , -NH 2 , -SH P -NCS, -OCO(CH 2 ) r COOH (where r=1-20), 
s -NHCO(CH 2 ) r COOH (where r=1-20), -OS0 2 OH, -OCO(CH 2 ) r l (where r=1-20), -OP(0-Alkyl)N(Alkyi) 2 , -CONH 2) 

-NH-C(S)-NH 2 , OP(0-Alky1)OH, and 0-CO-CH 2 -SH. 

83. An intact positively charged ionized and volatilized mass-modified nucleic acid molecule, comprising at least one 
mass-modified nucleotide selected from the group consisting of a mass-modified 2'-deoxynucleotide, a mass- 

10 modified 2\3 , -dideoxynucleotide and a mass-modified 3 '-deoxynucleotide. 

84. An intact ionized and volatilized mass-modified nucleic acid molecule, comprising at least two mass modified 
nucleotides. 

*5 85. A solid support, comprising a linking functionality, L\ linked to a nucleic acid primer via a linking group, L, of the 
primer to form a linkage L-L\ wherein: 

the interaction between L and L' is selectively cleavable enzymatically, chemically or physically; 
the primer, which is a primerf or enzymatic synthesis of nucleic acids, comprises a mass-modifying functionality 
(M) that introduces defined mass increments into the oligonucleotide molecule for mass-resolution by mass 
spectrometry, that is not a radiolabel or a fluorescent label, and that is linked directly to the primer, or the 
primer comprises an initiated nucleic acid chain that contains a nucleotide with a mass-modifying functionality 
(M); and 

the linkage L-L\ is selected from the group consisting of a photocleavable bond, a bond based on a strong 
electrostatic interaction, a tritylether bond, a p-benzoylpropionyl group and a levulinyl group. 

86. The solid support of claim 85, wherein: - • 

the mass-modification is a modification of a sugar moiety, base moiety or phosphate backbone; and 
30 is a modification of a nucieobase or bases in the chain or in the primer, to the phosphate backbone in the chain 

or in the primer or to a 2'-position of the nucleoside or nucleosides in the chain or in the primer. 

87. A microtiter plate adapted with a functionalized membrane, comprising a solid support and a reversibly linked 
nucleic acid primer in each well. 

35 

88. The solid support according to claim 85, wherein the photocleavable bond of linkage L-L', is selected from the 
group consisting of a charge transfer complex and a moiety, which forms a stable organic radical upon cleavage. 

89. A solid support having a linking functionality, L\ linked to a primer via a linking group, L, forming a photocleavable 
40 bond L-L", wherein the photocleavable bond is selected to be selectively cleaved by ultraviolet laser energy 

90. The solid support of claim 86, wherein the mass modifying functionality (M) is attached to a heterocyclic base of 
at least one nucleotide, thereby forming a heterocyclic base-modified nucleotide. 

45 91. The solid support of claim 85, wherein the mass modifying functionality (M) is attached to a heterocyclic base of 
at least one nucleotide, thereby forming a heterocyclic base-modified nucleotide; and 

the heterocyclic base-modified nucleotide is selected from the group consisting of a cytosine nucleotide 
modified at C-5, a thymine nucleotide modified at C-5, a thymine nucleotide modified at the C-5 methyl group, a 
uracil nucleotide modified at C-5, an adenine nucleotide modified at C-8, a c 7 -deazaadinine nucleotide modified 

50 at C-8, a c 7 -deazaadinine nucleotide modified at C-7, a guanine nucleotide modified at C-8, a c 7 -deazaguanine 

nucleotide modified at C-8, a c 7 -deazaguanine nucleotide modified at C-7, a hypoxanthine nucleotide modified at 
C-8, a c 7 -deazahypoxanthine nucleotide modified at C-7, and a c 7 -deazahypoxanthine nucleotide modified at C-8. 

92. The solid support of claim 86, wherein the mass-modifying functionality (M) is attached to one or more phosphorous 
55 atoms of an internucleotidic linkage of the primer or of the primer initiated nucleic acid chain. 

93. The solid support of claim 86, wherein the mass modifying functionality (M) is attached to one or more sugar 
moieties of nucleotides of the primer or primer initiated nucleic acid chain at least one sugar position selected from 
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the group consisting of an internal C-2' position, an external C-2' position, and an external C-5' position. 

94. The solid support of claim 86, wherein the mass-modifying functionality (M) is attached to the sugar moiety of a 5' 
terminal nucleotide and wherein the mass-modifying function (M) is the linking group (L). 

5 

95. The solid support of claim 86, comprising a set of base-specifically terminated fragments that comprise a mass 
modifying functionality, wherein the mass modifying functionality (M) is attached to the set of base-specifically 
terminated fragments subsequent to enzymatic synthesis of the base-specifically terminated fragments and prior 
to determining the molecular weight values for the fragments by mass spectrometry. 

10 

96. The solid support of claim 85 or 86, which is selected from the group consisting of; a bead, capillary, polymeric 
sheet, glass plate, and metal surface. 

97. The solid support of claim 96, wherein the bead is selected from the group consisting of: a magnetic bead, a 
*5 cellulose bead, polystyrene bead, Controlled Pore Glass (CPG) bead, silica-gef bead, a cross-linked dextran bead 

and an agarose bead. 

98. A solid support, comprising a linking functionality, L\ reversibly linked to a nucleic acid primer via a linking group, 
L, of the primer to form a linkage L-L\ wherein: 

20 

the interaction between L and L' is selectively cleavable enzymatically, chemically or physically; 
the primer, which is for enzymatic synthesis of nucleic acid molecules, comprises a mass-modifying function- 
ality (M) that introduces defined mass increments into the oligonucleotide molecule for mass-resolution by 
mass spectrometry, that is not a radiolabel and that is linked directly to the primer, or the primer comprises an 
25 initiated nucleic acid chain that contains a nucleotide with a mass-modifying functionality (M); and 

the linkage L-L\ is a photocleavable bond or a bond based on a strong electrostatic interaction. 

99. A solid support, comprising a linking functionality, L\ reversibly linked to a nucleic acid primer via a linking group, 
L, of the primer to form a linkage L-L\ wherein: 

30 

the interaction between L and L* is cleavable enzymatically, chemically or physically; and 
the primer contains a mass-modifying functionality (M) that is not a radiolabel or a fluorescent label, or the 
primer comprises an initiated nucleic acid chain that contains a nucleoside triphosphate with a mass-modifying 
functionality (M) that is not a radiolabel or a fluorescent label. 

35 

100. A method of sequencings nucleic acid, comprising: 

a) generating base-specifically terminated nucleic acid fragments from the nucleic acid to be sequenced; 

b) exposing the base-specifically terminated nucleic acid fragments to a single laser to produce desorbed/ 
40 ionized fragments; 

c) determining the molecular weight value of each desorbecl/ionized fragment produced by step (b) by mass 
spectrometry; and 

d) determining the nucleotide sequence by aligning the base-specifically terminated nucleic acid fragments 
according to molecular weight. 

45 

1 01 .A method of sequencing a nucleic acid, comprising: 

generating base-specifically terminated nucleic acid fragments from the nucleic acid to be sequenced; 
determining the molecular weight value of each base-specifically terminated fragment simultaneously by mass 
50 spectrometry; and 

determining the the nucleotide sequence by aligning the base-specifically terminated fragments according to 
molecular weight. 

102.The method of claim 100 or 101 , wherein the base-specifically terminated fragments are purified before the step 
55 ' of determining the molecular weight values by mass spectrometry. 

1 03The method of claim 1 02, wherein the base-specifically terminated fragments are purified by a method comprising; 
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immobilizing the base-specifically terminated nucleic acid fragments on a solid support; and 
washing out all remaining reactants and by-products. 

104. The method of claims 101 or 102 wherein a counter-ion of the phosphate backbone of the base-specifically ter- 
5 minated nucleic acid fragments is removed or is exchanged with a second counter-ion. 

105. The method of claim 101 or 102, wherin the molecular weight value of each fragment is determined by matrix- 
assisted laser desorption/ionization (MALDI-MS). 
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FIG.I 



MASS SPECTROMETRY 
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FIG.6 
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FIG. 9 
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FIG.I2 
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PCT Application No. 
N/A 


Filing Date 


I hereby declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and further that these statements 
were made with the knowledge that willful false statements and the like so made are punishable by 
fine or imprisonment, or both, under Section 1001 of Title 18 of the United States Code and that 
such willful false statements may jeopardize the validity of the application or any patent issued 
thereon. 

I hereby appoint the following attorneys and agents, with full power of substitution and revocation, 
to prosecute this application and to transact all business in the United States Patent and Trademark 
Office connected therewith and request that all correspondence and telephone calls in respect to this 
application be directed to Stephanie Seidman, HELLER EHRMAN WHITE AND McAULIFFE LLP, 4350 
La Jolla Village Drive, 7th Floor, San Diego, California 92122-1246; 858-450-8400: 


Attorney 


Reg. No. 


Stephanie Seidman 
Dale L. Rieger 
David A. Hall 


33,779 
43,045 
32,233 


and other members of the firm. 


Address for correspondence: 


Stephanie Seidman 

HELLER EHRMAN WHITE & McAULIFFE LLP 
4350 La Jolla Village Drive, 7th Floor 
San Diego, California 921 22-1 246 
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Full name of joint inventor: Joseph A. Monforte 

Inventor's signature: 


Date: 

Residence: 
Citizenship: 

Full name of joint inventor: 

Inventor's signature: 
Date: 

Residence: 
Citizenship: 


50 Alamo Avenue ' 

Berkeley, California 94708 
U.S.A. ' 


Thomas A. Shaler 


3910 Springfield Common 
Fremont, California 94555 
U.S.A. 


Full name of joint inventor: 

Inventor's signature: 
Date: 

Residence: 
Citizenship: 

Full name of joint inventor: 

Inventor's signature: 
Date: 


Yupinq Tan 


34188 O'Neil Terrace 
Fremont, California 94555 
People's Republic of China 

Christopher H. Becker 


Residence: 3404 Bryant Street 


Palo Alto, California 94036 
Citizenship: U.S.A. 
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